FIGURE l 



m 

-s.- r. 



GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGC AGGT C T GAAGAC T AACATT T T GT GAAG T TGTAAAACAGAAAACC T GT T AGAAATGT GGTGGT 
TTCAGCAAGGCC.TCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
T C AT ACAT T AC T GCAGTAACAC TC CACCAT ATAGACCC GGC T T T AC C T TAT ATCAGT GACAC T GG 
TACAGTAGCTCCAGAAAAATGCTTATTTGGGGCAATGCTAAATATTGCGGCAGTTTTATGCATTG 
C TACCAT T TATGT TCGTT AT AAGCAAGT T CATGC TC TGAGTCC TGAAGAGAACGT TATCAT CAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
TATATATGTTTGTTCAGACCATCCTTTCCTACCAAATGCAGCCCAAAATCCATGGCAAACAAGTC 
TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
AT C AG T T T T G C AC AG T G G C AAT T T T G G G AC T GAT T T AG AAC AGAAAC T C CAT T GGAAC C C C GAG G 
ACAAAGGTTATGTGCTTCACATGATCACTACTGCAGCAGAATGGTCTATGTCATTTTCCTTCTTT 
GGTTTTTTCCTGACTTACATTCGTGATTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACA 
TGGATTAACCCTCTATGACACTGCACCTTGCCCTATTAACAATGAACGAACACGGCTACTTTCCA 
v GAGATAT TTGATGAAAGGATAAAATAT TTCTGTAATGAT TATGAT TC TCAGGGAT TGGGGAAAGG 

1 TTCACAGAAGTTGCTTATTCTTCTCTGAAATTTTCAACCACTTAATCAAGGCTGACAGTAACACT 
;! GATGAATGCTGATAATCAGGAAACATGAAAGAAGCCATTTGATAGATTATTCTAAAGGATATCAT 
CAAGAAGAC TAT T AAAAACACCTAT GC CT AT AC T T T T T TAT C T CAGAAAAT AAAGT C AAAAGAC T 
ATG 
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FIGURE 2 

<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8.39, NX(S/T): 0 

MWWFQQGLSFLPSALVTWTSAAFI FSYITAVTLHHIDPALPYI SDTGTVAPEKCLFGAMLNIAAV 
LCIATIWRYKQVHALSPEENVIIKLNKAGLVLGILSCLGLSIVANFQKTTLFAAHVSGAVLTFG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHMITTAAEWSMSFSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTR 
LLSRDI 

Important features: 

Type II transmembrane domain: 

amino acids 13-33 

Other Transmembrane domains : 

amino acids 54-73, 94-113, 160-180, 122-141 

N-myristoylation sites. 

amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE a 



CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCT.GTGGCCCGAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACTTCGCCATGAGTTTCCTCATCGACTCCAGCATCATGATTACCTCCCAGATACTATTTTTTG 
GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
TTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGA 
g : CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 

=SKiS 

H CATT CTCAGCCCAAAACATGGGATC TT ATCCATAGAACAGCTCATCAGC CGGGTTGGTGT GAT T GGAG 

C| TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 

£H CTCAGGAATGTGACTGACACGGATATTCTAGCCCTGGAACGGCGACTGCTGCAAACCATGGATATGAT 

is f 

J* C ATAAG CAAAAAGAAAAGG AT G G C AAT GG CAC G GAGAAC AATG T T CCAGAAG G GGGAAGT G CAT AAC A 

, n f AACCATCAGGTTTCTGGGGAATGATAAAAAGTGTTACCACTTCAGCATCAGGAAGTGAAAATCTTACT 
!r CTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGCTTTTTCTGGAAACAGCTGATCT 
- ATAT GCTACCAAGGAGAGAA.TAGAATACTCCAAAACCTTCAAGGGGAAATATTTTAATTT TCTTGGTT 

Q ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 
111 GGGAAAACGGATCCTGTCACAAGAGGCATTGAGATCACTGTGAATTATCTGGGAATCCAATTTGATGT 

GAAGTTTTGGTCCCAACACATTTCCTTCATTCTTGTTGGAATAATCATCGTCACATCCATCAGAGGAT 

HI 

TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 
CTAT TAGCACAGATAAT GGGCAT GTAC T T TGTCTCCTCTGTGC T GCT GAT CCGAAT GAGTATGCCTTT 
AGAATACCGCACCATAATCACTGAAGT C CTT GGAGAACTGCAGT TCAACTT CTATCACC GTTGGTTT G 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAAT GGCACCT TGAACT TAAGCCTACTACAGACT GT TAGAGGCCAGTGGT TTCAAAATTTA 
GATATAAGAGGGGGGAAAAATGGAACCAGGGCCTGACATTTTATAAACAAACAAAATGCTATGGTAGC 
ATTTT TCACCT T CATAGCATACTCC TTCCCC GTCAGGTGATACTATGACCATGAGTAGCAT CAGCCAG 
AACATGAGAGGGAGAACTAACTCAAGACAATACTCAGCAGAGAGCATCCCGTGTGGATATGAGGCTGG 
TGTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGT 
CTATGGTAGCTGAGCCAAACACGTAGGATTTCCGTTTTAAGGTTCACATGGAAAAGGTTATAGCTTTG 
CCTTGAGATTGACTCATTAAAATCAGAGACTGTAACAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCG 
ACT C T AGAGTC GAC C T GCAGAAG C T TGGCCGC CAT G G C CCAAC T TGT T TAT T G CAGC T TAT AAT G 



n 
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FIGURE & 

MS FL I DS S IMI T S Q I LFFG FGWLFFMRQL FKDYE I RQYWQVI FS VT FAFSCTMFEL 1 1 FE I LGV 
LNSSSRYFHWKMNLCVILLILVEMVPFYIGYFIVSNIRLLHKQRLLFSCLLWLTBMYFFWKLGDP 
FPILSPKHGILS IEQLISRVGVIGVTLMALLSGFGAVNCPYTYMSYFLRNVTDTDILALERRLLQ 
TMDM IIS KKKRMAMARRTMFQKGE VHNKP S G FWGM I KS VT T S AS G S ENLT L I QQE VDALEE L S RQ 
LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCWKIFMATINIVFDRVGKTDPVTRGIEI 
TVNYLGIQFDVKFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISSSKSSNVIVLLLAQIMGMY 
FVS S VLL I RMSMPLE YRT 1 1 TEVLGELQ FNFYHRWFDVI FLVSALS S I L FLYLAHKQAPEKQMAP 

Important features: 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobal ami n -binding proteins 

amino acids 151-160 
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FIGURE s 



AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGTTCC 
AGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGCCATCTGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACGATGGCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GTCAT GCAGAAT TATGGGGAT CACC C T TG TGAGCAAAAAGGCGAACCAGCAGC TGAAT T TCACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGTTTGCAGCCTATTGTTACAACTCATCTGATACTTGGACTAACTCGTGCATTCCAGAA 
ATT AT CACCACCAAAGAT CCCATAT T CAACAC TC AAACT GCAACACAAACAACAGAAT T TAT T GT 
CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
CTGCTCCAGCTTCCACTTCTATTCCACGGAGAAAAAAATTGATTTGTGTCACAGAAGTTTTTATG 
GAAACTAGCACCATGTCTACAGAAACTGAACCATTTGTTGAAAATAAAGCAGCATTCAAGAATGA 
AGCTGCTGGGTfTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
CAGCTGGTCTTGGATTTTGCTATGTCAAAAGGTATGTGAAGGCCTTCCCTTTTACAAACAAGAAT 
CAGCAGAAGGAAATGATC GAAAC CAAAGTAGTAAAGGAGGAGAAGGCCAAT GATAGC AACCCT AA 
T GAGGAAT CAAAGAAAAC T GATAAAAACCCAGAAGAGT GCAAGAG TCCAAGCAAAACTACCGT GC 
GATGCCT GGAAGCTGAAGT T TAGA TGAGACAGAAATGAGGAGACACACCTGAGGCTGGT TTCTTT 
CATGCTCCTTACCCTGCCCCAGCTGGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 
CCCT T GGT T CC TAAC T GGAATCAGCT CAGGACTGCCAT TGGAC TATGGAGT GCACCAAAGAGAAT 
GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
T T CTAGCC T GGC TATG TCC TAATAATAT CCCAC T GGGAGAAAGGAGT T TTGCAAAGTGCAAGGAC 
CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
AAAGCCAAGGAGTCACTGAGACCAAGGCTTTCTCTACTGATTCCGCAGCTCAGACCCTTTCTTCA 
GCTCTGAAAGAGAAACACGTATCCCACCTGACATGTCCTTCTGAGCCCGGTAAGAGCAAAAGAAT 
GGCAGAAAAGT T TAGCCCC T GAAAGC CAT GGAGAT T C T CATAAC T TGAGACCTAATCT C T GT AAA 
GC TAAAATAAAGAAATAGAACAAGGCTGAGGATACGACAGTACAC TGTCAGCAGGGACTGTAAAC 
ACAGACAGGGTCAAAGTGTTTTCTCTGAACACATTGAGTTGGAATCACTGTTTAGAACACACACA 
CTTACTTTTTCTGGTCTCTACCACTGCTGATATTTTCTCTAGGAAATATACTTTTACAAGTAACA 
AAAAT AAAAAC T C T T AT AAAT T T C TAT T T T TAT C T GAG T T AC AGAAAT GAT T AC T AAGG AAGAT T 
ACTCAGTAATTTGTTTAAAAAGTAATAAAATTCAACAAACATTTGCTGAATAGCTACTATATGTC 
AAGT GCT G T GCAAGGTAT T ACAC TC TGTAAT TGAATAT TAT TCC T CAAAAAAT TGCACATAGT AG 
AACGCTATCTGGGAAGCTATTTTTTTCAGTTTTGATATTTCTAGCTTATCTACTTCCAAACTAAT 
TTTTATTTTTGCTGAGACTAATCTTATTCATTTTCTCTAATATGGCAACCATTATAACCTTAATT 
TAT TAT TAACATACC T AAGAAG TACAT TGT T ACC TC T AT AT ACCAAAGCACATT T TAAAAGT GCC 
ATTAACAAATGTATCACTAGCCCTCC T TT TT CCAACAAGAAGGGACTGAGAGATGCAGAAATATT 
TGTGACAAAAAATTAAAGCATTTAGAAAACTT 
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FIGURE 6 



MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 
L SLAGKDQVETALKAS FET CS YGWVGDGFWI SRI S PNPKCGKNGVGVLI WKVPVSRQFAAYCYN 
SSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKL I CVTEVFME TS TMS TETEPFVENKAAFKNEAAG FGGVPTALLVLALL FFGAAAGLG FC YVK 
RYVKAFP FTNKNQQKEMI E TKWKEEKANDSNPNEE SKKT DKNPEESKS PSKTT VRCLEAE V 

Signal sequence : 

amino acids 1-16 

Transmembrane domain: 

amino acids 235-254 

N-glycosylation site. 

amino acids 53-57, 130-134, 289-293 

Casein kinase II phosphorylation site. 

amino acids 145-149, 214-218 

Tyrosine kinase phosphorylation site. 

amino acids 79-88 

N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 



APPJD=1()063715 



Page 160 



FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC. 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCGCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 
GGAGGACACGCAGCACAAAT TGC GCAGCGC GGT GGAAGAGAT GGAGGCAGAAGAAGCTGCT GC TA 
AAGCATCATCAGAAG T GAACC T GGCAAACT TACC TCCCAGC TAT CAC AATGAGACCAAC AC AGAC 
AC GAAGGT T G GAAATAAT ACCAT CC AT GT GCACC GAGAAAT T CACAAGAT AACC AACAACC AGAC 
TGGACAAAT GGT C T T T TCAGAGACAG T TAT CAC ATC T GT GGGAGACGAAGAAGGCAGAAGG AGCC 
ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 
ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
O GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
■C| ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
p| ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
y CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 

CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
l2 GAGAT CCT GCT GCCCAGAGAGG TCCCCGAT GAG TAT GAAGT TGGCAGCT TCATGGAGGAGG T GCG 

[jjp CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 
*' CCGCCGCTGCACTGCTGGGAGGGGAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAA 
^ TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 
Cl CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
|l! CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
|*| GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
m TTTGTTCTACATGGCTTTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTC 
f*\ TGATTGGTTTTGGGGAT^ATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 
5| CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 
iy AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 

TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGTCACACAGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCT 
C T CCAC TACCC CACACCAGCC T T GGT GCCACCAAAAGTGC T CCCCAAAAGGAAGGAGAAT GGGAT 
TTTTCTTGAGGCATGCACATCTGGAATTAAGGTCAAACTAATTCTCACATCCCTCTAAAAGTAAA 
CTACTGTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGAT 
ATTGACACTGTCCCTCTTTGGCAGTTGCATTAGTAACTTTGAAAGGTATATGACTGAGCGTAGCA 
TACAGGT TAACC TGCAGAAAC AGTAC T T AGGTAATTGTAGGGCGAGGATT ATAAAT GAAAT T T GC 
AAAATCAC T TAGCAGCAAC T GAAGACAATT ATCAACCACG T GGAGAAAAT CAAACCGAGCAG GGC 
TGTGTGAAACATGGTTGTAATATGCGACTGCGAACACTGAACTCTACGCCACTCCACAAATGATG 
TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 
CATGAT TGT ATAAGCATGCT T TC T T TGAGT T T TAAAT TAT GT ATAAACAT AAGTTGCAT T TAGAA 
ATCAAGC ATAAATCACT TCAAC T GCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 8 



MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RSAVEEMEAEEAAAKASSEWLANLPPSYHNETNTDTKVGNNTIHVHREIHKITNNQTGQWFSE 
TVITSVGDEEGRRSHECIIDEDCGPSMYCQFASFQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVPDEYEVGSFMEEVRQELEDLE 
RSLTEEMALGEPAAAAAALLGGEE I 

Signal sequence: 

amino acids 1-19 

N-glycosylation site. 

amino acids 96-100, 106-110, 121-125, 204-208 
Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 
327-331 

N-myristoylation site. 

amino acids 202-208, 217-223 

Amidation site. 

amino acids 140-144 
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FIGURE 0 



CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAGTAT ATTGACCACAAGCT CAGAGGAGTGCAGAACCCTGTAGCCCGCTGCATCATGTGCTGTT T 
C AAG TGC T GC C T C TGG TGTC TGGAAAAAT T TAT CAAGT TC C T AAACC GCAAT GC AT AC AT CAT GA 
TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 
ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 
GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 
|f| GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 
* AGAT T CT GGGCAAGAAGAACGAGGCGCCC CCGGACAACAAGAAGAGGAAGAAGTGACAGCT CCGG 

Cl CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 

CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
TTGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 
AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 
GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 
GATCGCGCCACTGCACTCCAACCTGGGTGACAGACTCTGTCTCCAAAACAAAACAAACAAACAAA 
AAGATT T TAT TAAAGATATT TTGT TAACTC 



m 
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FIGURE 10 

RTRGRTRGGCEKVPINTSCNPTAHLVNSSCPGLMCVFQGYSSKGLIQRSVFNLQIYGVLGLFWTL 
NWVLALGQCVLAGAFAS FYWAFHKPQD I PT FPL I S AFIRT LRYHTGS LAFGAL I L'TL VQ IARVI L 
E Y I DHKLRGVQNPVARC IMCC FKCCLWCLEKFI KFLNRNAY IM I AI YGKNFC VSAKNAFMLLMRN 
IVRVWLDKVTDLLLFFGKLLWGGVGVLSFFFFSGRIPGLGKDFKSPHLNYYWLPIMTSILGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 



Important features: 
Lju Transmembrane domains: 

CI amino acids 57-80 (type II), 110-126, 215-231, 254-274 

m 

5 ; u N-glycosylation sites. 



amino acids 16-20, 27-31, 289-293 



Ui Hypothetical YBR002c family proteins. 

amino acids 276-288 

is*;?? 

m 

IS? 5 

jp| Ammonium transporters proteins . 

fl| amino acids 204-231 

m 

N-myristoylation sites. 

amino acids 60-66, 78-84 

Ami da ti on site. 

amino acids 306-310 
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FIGURE 11 

GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTAGAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
k ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
I CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 
)■ GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 
i TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 
I CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 
f CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 
I TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 
GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGGTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 
| TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 
! AACAGCCTGATGCAGACCGAGGAGTGCCCACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 
I GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 
I ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTAGAAGCCCGGTGAG 
I ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 
CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGC TGA GGCAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GACTTCGTGCCTTACTGAGTCTCTAAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAZ^AAAA 
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FIGURE 12 



MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVESQL 

YKL PWVCE E GAG I PT VLQGH I DCGSLLG YRAVYRMC FAT AAFFFFFFT LLMLC VS S SRDPRAAI Q 

NGFWFFKFLILVGLTVGAFYIPDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSWNQRWLGKAE 

ECDSRAWYAGLFFFTLLFYLLSIAAVALMFMYYTEPSGCHEGKVFISLNLTFCVCVSIAAVLPKV 

QDAQPNSGLLQASVITLYTMFVTWSALSSIPEQKCNPHLPTQLGNETVVAGPEGYETQWWDAPSI 

VGLI I FLLCTLFI S LRSS DHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVT YS Y 

SFFHFCLVIAJ3LHVMMTLTNWYKPGETR 

FS 

Signal sequence: 

amino acids 1-20 

Transmembrane domains : 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 
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FIGURE 13 



CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GG T TGGAAAAAGAC T CC T G TAACCC TCC T GCAGGATGAACCACC T GC CAGAAGACATGGAGAACG 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
CTCATGGCCAGGATTGAGTCCTATGAAGGAAGGGAAAAGA^AGGCATATCTGATGTCAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
T GAAT G GAGGC AT TGAGAACACAT T AGAGAAGGAGGT GAT GCAGTAT G AC T AC TAT T C T TC ATAT 
TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 
m ACTGCGCCATTGGTGGGCAATAGCGTTGACAACGGCAGTGACCAGTGCCTTTTTACTAGCAAAAG 
O TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
W C T TGCC T GGAT TGAGACG T GGT TCC T GGAT T T CAAAGT GT T ACC T CAAGAAGCAGAAGAAGAAAA 

Or 

7\ CAGACTCCTGATAGTTCAGGATGCTTCAGAGAGGGCAGCACTTATACCTGGTGGTCTTTCTGATG 

GTCAGTTTTATTCCCCTCCTGAATCCGAAGCAGGATCTGAAGAAGCTGAAGAAAAACAGGACAGT 
GAGAAACCACT T T TAGAAC TATGAGTAC TACT T T TGT TAAATGTGAAAAAC C C TCACAGAAAGTC 
ATCGAGGCAAAAAGAGGCAGGCAGTGGAGTCTCCCTGTCGACAGTAAAGTTGAAATGGTGACGTC 
CAC TGCT GGCT T TAT T GAACAGC TAAT AAAGAT T TAT TT AT TGTAATACC T CACAAAC GT T GT AC 
CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
n TCAGTGAGACTGAGCCTGATGTGTTAACAAATAGGTGAAGAAAGTCTTGTGCTGTATTCCTAATC 
f)i| AAAAG AC T T AA T A T A T T G AAG T AAC AC TTTTTTAG T AAG C AAG AT AC CTTTTTATTT C AAT T CAC 
O AGAATGGAATTTTTTTGTTTCATGTCTCAGATTTATTTTGTATTTCTTTTTTAACACTCTACATT 

m 

- V TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 

ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
CTTGCAAAAGGGGAAGAAAGGAATTGCGAATACATGTAAAATGTCACCAGACATTTGTATTATTT 
TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
ACAAAATGACT TAAACCAT TCATATCATGTT TCCTTTGCGT TCAGCCAATT TCAAT TAAAAT GAA 
C T AAAT T AAAAA 



•r,r ;: 



APP_ID=10063715 Page 167 of 322 



FIGURE 14 

MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRTFCLFVTFDLLF 
VTLLWIIELNTOGGIENTLEKEVMQYDYYSSYFDIFLLAVFRFKVLIIAYAVCRLRHWWAIALTT 
AVTSAFLLAKVILSKLFSQGAFGYVLPIISFILAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-20 

Transmembrane domains: 

amino acids 54-72, 100-118, 130-144, 146-166 
N-myr i s toyl a t i on si tes . 

amino acids 14-20, 78-84, 79-35, 202-208, 217-223 
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FIGURE is 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCeCTGGGGCCTGGGGTGCAGGGC'TGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGG 
s . CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
CI TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
p CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
ifl TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
f I TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 

TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
N CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
|4 CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 
Iff TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 

CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 
ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 
CI GGCC T GTAC T G TGAGAGC CAGATGGGGCAGGGGACACGGCC CAGC CC TACACCAG T CAC GC C GAG 

Iff GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
p% AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
^ GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 

GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
Cl GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 
fl| CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 

GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
TCCAGGCGGACCCTGGGGGCCAGTQAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGC T GT GT GAC T C TAGTC T T GGCCCCAGGAAGC GAAGGAACAAAAGAAAC T GGAAAGGAAGATGC 
T T TAG GAAC AT GTTTTGCTTTTT T AAAAT AT AT AT A T T T AT AAGAG AT C C T T T C C CAT T TAT T C T 
GGGAAGAT G TT T T TCAAACTCAGAGACAAGGAC T T T GGT T T T TGTAAGACAAACGATGATAT GAA 
GGCC T T T T GTAAGAAAAAATAAAAGAT GAAGT GT GAAA 
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FIGURE 16 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPIANLSNLDLTANRLHEITNETFRGLRRLERLY 
LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEPGILDTANVE 
ALRIjAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 
AGLAALQELDVSNL S LQALPGDLSGL FPRLRLLAAARNP FNCVCPLS W FGPWVRE S HVT LAS PEE 
TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSLAPTWLSPTAPATEAP 
SPPSTAPPTYGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 
VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 
TVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPAVHSNHAPVTQAREGNLPLLIAPALAAV 
LIAAIJ^AVGAAYCWRGRAMAAAAQDKGQVGPGAGPLELEGVKVPLEPGPKATEGGGEALPSGSE 
CEVPLMGFPGPGLQS PLHAKPY I 

Important features: 
Signal peptide: 

amino acids 1-23 
Transmembrane domain: 

amino acids 579-599 

EGF-like domain cysteine pattern signature, 
amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N -myr i s toy 1 a ti on s i tes . 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 



II! 



GCAGCGGCGAGGCGGCGGTGGTGGCTGAG'TCCGTGGTGGCAGAGGCGAAGGCGACAGCTCMGCG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
ATGAAGAAGGCAGCCAGGAT GAATCC T TAGAT TC CAAGACTACT T TGACAT CAGAT GAG TCAG TA 
AAGGACCATACTACTGCAGGCAGAGTAGTTGCTGGTCAAATATTTCTTGATTCAGAAGAATCTGA 
AT T AGAAT C C T C TAT T CAAGAAGAGGAAGACAG C C T CAAG AG C CAAGAG G G GGAAAG T G T C AC AG 
AAGATATCAGC TT TCTAGAGTC T CCAAAT CCAGAAAACAAGGAC TATGAAGAGCCAAAGAAAGTA 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
T T TCC T AGATAAGGAG TAT GATGAATG TACATCAGAT GGGAGGGAAGAT GGCAGACTGT GGT GTG 
CTACAACCTATGACTACAAAGCAGATGAAAAGTGGGGCTTTTGTGAAACTGAAGAAGAGGCTGCT 
AAGAGACGGCAGATGCAGGAAGCAGAAATGATGTATCAAACTGGAATGAAAATCCTTAATGGAAG 
CAATAAGAAAAGCCAAAAAAGAGAAGCATATCGGTATCTCCAAAAGGCAGCAAGCATGAACCATA 
CCAAAGCCCTGGAGAGAGTGTCATATGCTCTTTTATTTGGTGATTACTTGCCACAGAATATCCAG 
GCAGCGAGAGAGATGTTTGAGAAGCTGACTGAGGAAGGCTCTCCCAAGGGACAGACTGCTCTTGG 
CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
TTGGAGCTCTTGGGGGCAATCTAATAGCCCACATGGTTTTGGTAAGTAGACTTTAGTGGAAGGCT 
. AATAATAT TAACATC AGAAG AAT T T G T G GT T TATAGC GGC CACAAC T T T T T CAGC T T TCATG AT C 
CAGAT T T GC T TG TAT T AAGACCAAATAT T C AGT TGAAC T TCC T TCAAAT T CT T GT TAATGGATAT 
AACACAT GGAAT C TACATGTAAATGAAAG T TGG TGGAGT CCACAAT T T T TC T T TAAAATGAT TAG 
TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 
GAAT TGTCAGAATCAT T T T T TACAT TAGAT TATCATAATT T TAAAAAT T T T T C T T TAGTT T T T CA 
AAAT T T TGTAAATGGT GGCTAT AGAAAAACAACATGAAAT AT TATACAAT AT TT TGCAACAATGC 
C C T AAGAAT T GT TAAAAT T CAT GGAGT T AT T TG T GC AGAAT GAC TC CAGAG AGC TC T AC T T T C T G 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCT7\ATTTGG 
AAGAT TAAC T CAT T T T T AATAAAAT TATG TC TAAGAT TAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 18 



MRVRI GLT LLLCAVLLS LASAS S DEEGS QDE SLDSKTTLT S DE S VKDHT TAGRWAGQI FLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGRE DGRLWCATT YDYKADEKWGFCETEEEAAKRRQMQEAEMMYQT GMK I LNGSNKKS QKR 
EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
S S QAKALVYYT FGALGGNLI AHMVLVS RL 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites . 

amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 

N-myristoylation sites. 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

Glycosaminoglycan attachment site. 

amino acids 267-271 

Microbodies C-terminal targeting signal. 

amino acids 2 99-303 

Type II fibronectin collagen-binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE iq 

AATTCAGATTTTAAGCCCATTCTGCAGTGGAATTTCATGAACTAGCAAGAGGACACCATCTTCTT 
GTATTAT ACAAGAAAGGAG TGTACCTATCACACACAGGGGGAAAAATGC TC T T T T GGGT GCTAGG 

CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 
ATAAGT ACATT T T TAT CAC T GGATGTGACT C GGGC TT T GGAAAC T TGGCAGCCAGAACT TT T GAT 
AAAAAGGGAT T TCATGTAATCGC TGCCTGTCTGAC TGAATCAGGATCAACAGC TT TAAAGGCAGA 
AACCTCAGAGAGACTTCGTACTGTGCTTCTGGATGTGACCGACCCAGAGAATGTCAAGAGGACTG 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
CCCGGCGT GC T GGC T CCCAC TGAC TGGC T GACAC TAGAGGAC TACAGAGAACC TAT TGAAGT GAA 
CC TGT T T GGAC TCATC AG T G T GACAC TAAATATGC T T C CT T T GG TCAAGAAAGC TCAAGGGAGAG 
TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 
TATGCAGTGGAAGGTTTCAATGACAGCTTAAGACGGGACATGAAAGCTTTTGGTGTGCACGTCTC 
ATGCATT GAACCAGGAT TGT TCAAAACAAACT TGGCAGATCCAGTAAAGGTAAT TGAAAAAAAAC 
TCGCCATTTGGGAGCAGCTGTCTCCAGACATCAAACAACAATATGGAGAAGGTTACATTGAAAAA 
AGTCTAGACAAAGTGAAAGGCAATAAATCCTATGTGAACATGGACCTCTCTCCGGTGGTAGAGTG 
CATGGACCACGCTCTAACAAGTCTCTTCCCTAAGACTCATTATGCCGCTGGAAAAGATGCCAAAA 
TTTTCTGGATACCTCTGTCTCACATGCCAGCAGCTTTGCAAGACTTTTTATTGTTGAAACAGAAA 
GCAGAGCTGGCTAATCCCAAGGCAGTGTGACTCAGCTAACCACAAATGTCTCCTCCAGGCTATGA 
AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 
CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 
CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
ATCATCTC T TATCTAAAT AT TAAAAGATAAGT CAAC CCAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA 
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FIGURE 20 



MLFWVLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAARTFDKKGFHVIAACLTESG 
STALKAETSERLRTVLLDVTDPENWRTAQWVKNQVGEKGLWGLI]MAGVPGVLAPTDWLTLEDY 
REPIEVNLFGLISVTLNMLPLVKKAQGRVINVSSVGGRLAIVGGGYTPSKYAVEGFNDSLRRDMK 
AFGVHVSCIEPGLFKTNLADPVKVIEKKIAIWEQLSPDIKQQYGEGYIEKSLDKLKGNKSYVHMD 
LSPWECMDH7U.TSLFPKTHYAAGKDAKIFW.IPLSHMPAALQDFLLLKQKAELANPKAV 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites. 

amino acids 161-163, 187-190 and 253-256 

Glycosaminoglycan attachment site. 

amino acids 39-42 

N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 

CTGAGGCGGCGGTAGCATGGAGGGGGAGftGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCfiGr.fi 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GGTGAAGCCAAGAACAGCATTAC TGATTCCCAAATGGATGAT GT TGAAGTTGT TTATACAATTGA 
CAT TCAGAAATATATT CCATGCTATCAGCTT TT TAGCT TT TATAATTCT TCAGGCGAAGT AAATG 
AGCAAGCACTGAAGAAAATATTATCAAATGTCAAAAAGAATGTGGTAGGTTGGTACAAATTCCGT 
CGTCAT TCAGAT CAGATCAT GACGT TT AGAGAGAGGC T GC T T CACAAAAAC T TGCAGGAGCAT T T 
T T CAAACCAAGACC T TGTTTTTCT GCT AT T AAC ACCAAGT ATAATAAC AGAAAGC T GC T C TAC TC 
ATCGACTGGAACATTCCTTATATAAACCTCAAAAAGGACTTTTTCACAGGGTACCTTTAGTGGTT 
GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
O T T TTAGCCGAGCAGTACAAACACACAGC TCTAAATTT TTTGAAGAAGATGGATCCT TAAAGGAGG 

*? I TACATAAGATAAATGAAAT GTATGCTTCAT TACAAGAGGAATTAAAGAGTATATGCAAAAAAGTG 

fl GAAGACAGTGAACAAGCAGTAGATAAACTAGTAAAGGATGTAAACAGAT TAAAACGAGAAAT TGA 

fa GAAAAGGAGAGGAGCACAGAT T CAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCC TCAGGAGA 

|J1 ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 
* ATGT C T TT AAAAAATAGACATGT T TCTAAAAGTAGC TG TAAC TACAACCACCATC TCGAT GT AGT 

AGACAATC T GACC T T AATGGTAGAACACACT GAC AT T CCT GAAGC TAG TCCAGCTAG TACACCAC 
AAATC AT T AAGCATAAAGC C T T AGAC T TAGATGACAGAT GGCAAT TCAAGAGAT C TCGGT T GT TA 
GATACACAAGACAAACGATCTAAAGCAAATACTGGTAGTAGTAACCAAGATAAAGCATCCAAAAT 
GAGCAGCCCAGAAACAGATGAAGAAATTGAAAAGATGAAGGGTTTTGGTGAATATTCACGGTCTC 
C TACAT T T TGAT C C T T T T AAC C T TACAAGGAGAT TTTTTTATTTGGCT GAT G GG T AAAGCC AAAC 
ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
TGT T TGCAGTAATACACAGATAACT CT TAGT GCAT T TACT TCACAAAGT AC T T T T TCAAACAT CA 
GATGCT T T TAT T TCC AAACC T t T T T T T CACC T T T CACTAAGT T GT T GAGGGGAAGGC T T AC ACAG 
ACACATTCTTTAGAATTGGAAAAGTGAGACCAGGCACAGTGGCTCACACCTGTAATCCCAGCACT 
TAGGGAAGACAAGT CAGGAGGAT TGAT T GAAGC TAGG AGT TAGAGACCAGCC TGGGCAAC G TAT T 
GAGACCAT G TC TAT TAAAAAATAAAAT GGAAAAGCAAGAATAGCC TT AT T T TCAAAATATGGAAA 
GAAAT T TATATGAAAAT T T ATC T GAGT CAT TAAAATT C T C C T TAAGTGAT AC T T T T T T AGAAGTA 
CATTATGGCTAGAGTTGCCAGATAAAATGCTGGATATCATGCAATAAATTTGCAAAACATCATCT 
AAAATTTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 



MEGE S T S AVLSGFVLGALAFQHLNTDS DTEG FLLGE WGEAKNS I TDSQMDDVEWYT I D I QKY I 
PCYQLFSFYNSSGEVNEQALKKILSNVKKNWGWYKFRRHSDQIMTFRERLLHKNLQEHFSNQDL 
VFLLLT PS 1 1 TE S C S THRLEHSL YKPQKGL FHRVPL WANLGMS EQLGYKT VS GS CMS TGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
Q I QAAREKN I QKDPQENI FLCQALRT FFPNSE FLHS C VMS LKNRHVS KS SCNYNHHLD WDNL T L 
MVEHT D I PEAS PAST PQI I KHKALDLDDRWQ FKRSRLLDT QDKRS KANT GS SNQDKAS KMS S PE T 
DEEIEKMKGFGEYSRSPTF 

B • 

Qs Important features: 

Signal peptide: 

amino acids 1-19 

N-glycosylation sites . 

amino acids 75-79, 322-326 

N-myristoylation site. 

amino acids 184-154 

Growth factor and cytokines receptors family. 

amino acids 134-150 
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FIGURE 23 



GGCACAGCCGCGCGGCGGAGGGCAGAGTCAGCCGAGCCGAGTCCAGCCGGACGAGCGGACCAGCGCAGGGCAGCCCAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
AACAAGAACGATGCCCCCCTCCACGAGATCAACGGTGATCATTTGAAGATCTGTCCCCAGGGTTCTACCTGCTGCTCT 
CAAGAGAT GGAGGAGAAGTACAGC CT GCAAAGTAAAGAT GATT T CAAAAGT GT GGT CAGCGAACAGT GCAAT CATT T G 
CAAGCTGTCTTTGCTTCACGTTACAAGAAGTTTGATGAATTCTTCAAAGAACTACTTGAAAATGCAGAGAAATCCCTG 
AAT GATAT GTT T GTGAAGACAT AT GGCCATT T ATACAT GCAAAATT CT GAG CTATT TAAAGAT CTCT T C GTAGAGTTG 
Cf! AAACGTTACTACGTGGTGGGAAATGTGAACCTGGAAGAAATGCTAAATGACTTCTGGGCTCGCCTCCTGGAGCGGATG 
Ul TT C CGCCT GGT GAACT CCCAGTACCACT TTACAGAT GAGTATCT GGAAT GT GTGAGCAAGTATACGGAGCAGCT GAAG 

CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 
TTAGCGGTTGCGGGAGATGTCGTGAGCAAGGTCTCCGTGGTAAACCCCACAGCCCAGTGTACCCATGCCCTGTTGAAG 
iff ATGATCTACTGCTCCCACTGCCGGGGTCTCGTGACTGTGAAGCCATGTTACAACTACTGCTCAAACATCATGAGAGGC 
S TGTTTGGCCAACCAAGGGGATCTCGATTTTGAATGGAACAATTTCATAGATGCTATGCTGATGGTGGCAGAGAGGCTA 

GAGGGTCCTTTCAACATTGAATCGGTCATGGATCCCATCGATGTGAAGATTTCTGATGCTATTATGAACATGCAGGAT 
AATAGTGTTCAAGTGTCTCAGAAGGTTTTCCAGGGATGTGGACCCCCCAAGCCCCTCCCAGCTGGACGAATTTCTCGT 
■3 TCCATCTCTGAAAGTGCCTTCAGTGCTCGCTTCAGACCACATCACCCCGAGGAACGCCCAACCACAGCAGCTGGCACT 
|!l AGTTTGGACCGACTGGTTACTGATGTCAAGGAGAAACTGAAACAGGCCAAGAAATTCTGGTCCTCCCTTCCGAGCAAC 

GTTTGCAACGATGAGAGGATGGCTGCAGGAAACGGCAATGAGGATGACTGTTGGAATGGGAAAGGCAAAAGCAGGTAC 
CT GTTT GCAGT GACAGGAAAT GGAT TAG CCAAC CAGGGCAACAACC CAGAGGT CC AGGTTGACACCAGCAAACCAGAC 
ATACTGATCCTTCGTCAAATCATGGCTCTTCGAGTGATGACCAGCAAGATGAAGAATGCATACAATGGGAACGACGTG 
GACTTCTTTGATATCAGTGATGAAAGTAGTGGAGAAGGAAGTGGAAGTGGCTGTGAGTATCAGCAGTGCCCTTCAGAG 
TTTGACTACAATGCCACTGACCATGCTGGGAAGAGTGCCAATGAGAAAGCCGACAGTGCTGGTGTCCGTCCTGGGGCA 
CAGGCCTACCTCCTCACTGTCTTCTGCATCTTGTTCCTGGTTATGCAGAGAGAGTGGAGATAATTCTCAAACTCTGAG 
AAAAAGTGTTCATCAAAAAGTTAAAAGGCACCAGTTATCACTTTTCTACCATCCTAGTGACTTTGCTTTTTAAATGAA 
TGGACAACAATGTACAGTTTTTACTATGTGGCCACTGGTTTAAGAAGTGCTGACTTTGTTTTCTCATTCAGTTTTGGG 
AGGAAAAGGGACTGTGCATTGAGTTGGTTCCTGCTCCCCCAAACCATGTTAAACGTGGCTAACAGTGTAGGTACAGAA 
CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTCATTTCGTTTGTGGGTT 
TTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACCAGGGTCCCTTCTTGGCACGTAACATGTACGTATT 
TCT GAAATATTAAATAGCTGTACAGAAGCAGGTTTTATTTATCAT GTTATCTTATTAAAAGAAAAAGCCCAAAAAGC 
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FIGURE 24 

MARFGLPALLCTLAVLSAALLAAELKSKSCSEVRRLW 

CCS QEMEEKYS LQSKDDFKS WS EQCNHLQAVFAS RYKKFDE FFKELLENAEKS LNDMFVKT YGH 
LYMQNSELFKDLFVELKRYYWGNVNLEEMLNDFWARLLERMFRLVNSQYHFTDEYLECVSKYTE 
QLKPFGDVPRKLKLQVTRAFVAARTFAQGIAVAGDWSKVSVVNPTAQCTHALLKMIYCSHCRGL 
VTVKPCYNYCSNIMRGCIANQGDLDFEWNNFIDAMLMVAERLEGPFNIESWDPIDVKISDAIMN 
MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 
EKLKQAKKFWSSLPSIWCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGLANQGNNPEVQVDTS 
KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 
KSANEKADSAGVRPGAQAYLLTVFCILFLVMQREWR 

Important features: 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 515-524 

N-glycosylation site. 

amino acids 514-518 

Glycosaminoglycan attachment sites. 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 2* 

C T CGCC C T CAAATGGGAACGC TGGCC TGGGACTAAAGCATAGACCACCAGGCTGAGTAT CCT GAC 
C TGAGT CAT CCCCAGGGATCAGGAGC CTCCAGCAGGGAACC T TCCAT TATAT TC TTCAAGCAAC T 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAAAGATTGGTTCCTGAG 
AGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGCAGTGCCCCTGTGATCATT 
TCAAGGGCAATGTGAAGAAAACAAGACACCAAAGGCACCACAGAAAGCCAAACAAGCATTCCAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTGTAGGAGCTCTG 
AGCGC CCAC T C T T CCAAT T AAAC AT T CT CAGC CAAGAAGACAG T GAGC ACACC TAC CAGACAC TC 
TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
AGC TAGT G T CAT T TAACC T TAAATGCAAT CAGGAAAGTAGCAAACAGAAG T C AATAAATAT T T T T 
AAATGTCAAAAAAAAAAAAAAAAAA 
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FIGURE 26 

MKVLISSLLLLLPI^mSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 

Important features: 
Signal peptide: 

amino acids 1-22 



N-myristoylation sites. 

amino acids 27-33, 46-52 



OS 
hi 

SI 

u 



,S-.;5, 
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FIGURE 27 

GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
TGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGCTCTTCAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 
GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
tnl CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 
S| TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 
ffl GAGCCCACCTGGAAACACATTGGGGATGGCTGCTGCCTCACCAGAGAGACCTGGAAGGATCTTGA 

W GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 

M 

GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCGCAAGCTCCAAGGCACTCATTTGCTCC 

TTCCCCAGCCTCCAATTAGAACAAGCCACCCACCAGCCTATCTATCTTCCACTGAGAGGGACCTA 
" ? " GCAGAATGAGAGAAGACAT TCATG TAC CAC C TAC TAGT CCC T C T C TCCC CAACCT C TGCCAGGGC 

Q AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 
III AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 
Jjj CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
pj CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 
ifj CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 

ATGCCAGAGCAAGAC T CAAAGAGGCAGAGGT TT T GT TC TCAAATATT T T TTAATAAATAGAC GAA 

ACCACG 
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FIGURE 28 



MDILVPLLQLLVLLLTLPLHLMALL^ 

TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFWAPGEDM 
RQLADGSMDVWCTLVLCSVQSPRKVLQEVRRVLRPGGVLFFWEHVAEPYGSWAFMWQQVFEPTW 
KHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFPSSKALICSFPSL 
QLEQATHQP I YL PLRGT 

Important features : 
Signal peptide: 

amino acids 1-23 

Leucine zipper pattern. 

amino acids 10-32 

N-myristoylation sites. 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 

CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCTATGCTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCGAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGCTGGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGCTAGGGGGGCTGCCTTATTTAAA 
GTGGT TGT T TATGAT TC TTATAC TAAT T TATACAAAGAT AT TAAGGCCC TGT TCAT TAAGAAAT T 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGTTAAAAGCTGAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE so 

MLLLTLLLLLLLLKGSCLSWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 

Important features: 
Signal peptide: 

amino acids 1-15 

Growth factor and cytokines receptors family: 

amino acids 3-18 
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FIGURE 31 

GTTTGAATTCCT TCAACTATACCCACAGTCCAAAAGCAGACT CAC TGTGTCCCAGGCTACCAGT T 
CCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
ATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAATCATGTCGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TGTCTGCGGTGTTTTATGGTGGCTGTATTATGACTATACCAAGGACCTCAGCATAGAATTGGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
CTGCTCGTCTTGATTTTTGTTCTCAGAAAGAGAATAAAATTGACAGTTGAGGTTTTCCAAATCAC 
AAATAAAG C CATCAGCAG TGCTCCCTTCCT GC T GT TCCAGC CAC T G T GGAC AT T TGCC AT CC T CA 
TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 
TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 
TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
GATTCCGAGAATCATTGTCATGTACATGCAAAACGCACTGAAAGAACAGCAGCATGGTGCATTGT 
CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
AACCAGAAT GCATATACTACAAC T GC TAT T AATGGGACAGAT T T C TG TACAT CAGCAAAAGAT GC 
ATTCAAAATCTTGTCCAAGAACTCAAGTCACTTTACATCTATTAACTGCTTTGGAGACTTCATAA 
TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
TGGAAACAAATGATGGATCGTCAGAAAAGCCCTACTTTATGGATCAAGAATTTCTGAGTTTCGTA 
AAAAGGAGCAACAAATTAAACAATGCAAGGGCACAGCAGGACAAGCACTCATTAAGGAATGAGGA 
GGGAACAGAAC TCCAGGCCAT TGTGAGATAGATACCCAT TTAGGTATCTGTACC TGGAAAACAT T 
T C C TTCT AAGAGCCAT T TACAGAATAGAAGATGAGACCACT AGAGAAAAGT TAGT GAAT T TT TTT 
TTAAAAGACCTAATAAACCCTATTCTTCCTCAAAA 
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FIGURE 32 

MSGRDTILGLCILALALSLAMMFTFRFITTLLVHIFISLVILGLLFVCGVLWWLYYDYTNDLSIE 
LDTERENMKCVLGFAIVSTGITAVLLVLIFVLRKRIKLTVELFQITNKAISSAPFLLFQPLWTFA 
I L I FFWVLWVAVLL S LGTAGAAQVMEGGQVE YKPL S G I RYMWS YHL I GL I WT S E FI LACQQMT IA 
GAWTCYFNRSKNDPPDHPILSSLSILFFYHQGTWKGSFLISWRIPRIIVMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FI I FLGKVLWCFTVFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAHS FLSVFETVLDALFLCFA 
VDLE TNDGSSEKP YFMDQE FLS FVKRSNKLNNARAQQDKHSLRNEE GTELQAI VR 

Important features: 
Signal peptide: 

amino acids 1-20 

Putative transmembrane domains : 

amino acids 35-54 , 75-97, 126-146, 185-204, 333-350, 352-371 
N-glycosylation sites. 

amino acids 204-208, 295-299, 313-317 
N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE aa 



GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 
ATAATTTGTATGGGATTTGTGATGCAGGAAAGCCTAAGGGAAAAAGAATATTCATTCTGTGTGGT 
GAAAATTTTTTGAAAAAAAAATTGCCTTCTTCAAACAAGGGTGTCATTCTGATATTT ATGA GGAC 
TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
ATTCAAACAAAGAAACGGCAAAGAAGAT TAAAAGGCCCAAG T TCACTGT GC C TCAGATCAAC TGC 
GATGT CAAAGCC GGAAAGAT CAT CGAT C CT GAGT TCAT TGT GAAATGT CCAGCAGGAT GC CAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGTGGTGT GC T T GAT AAT T CAGGAGGGAAAATACT T GT TC GGAAGG T T GC T GGACAGT C T 
GGT TACAAAGGGAGT TAT T CCAACGGT G TC CAAT CGT TAT C CC TACCACGAT GGAGAGAAT C C T T 
TATCGT C T T AGAAAGTAAACCCAAAAAGGGT G TAACCTACC CATCAGC TCT TACATAC T CAT CAT 
CGAAAAGTCCAGCTGCCCAAGCAGGTGAGACCACAAAAGCCTATCAGAGGCCACCTATTCCAGGG 
y s ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
£ B t CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 
y GCCACAGGAGCCAGGAGAT GGATC TCT GGTCCAC TGCCACCTACACAAGCAGCCAAAACAGGCCC 

0 AGAGC T GAT CCAGGTAT CCAAAGGCAAGAT CC T T CAGGAGC T GC C T TC CAGAAACC TGT T GGAGC 

y| GGAT GT CAGCC TGGGAC TTGT TC CAAAAGAAGAAT TGAGCACACAG TC T T T GGAGCCAG TAT C CC 

h| TGGGAGATCCAAACTGCA7VAATTGACTTGTCGTTTTTAATTGATGGGAGCACCAGCATTGGCAAA 
. CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
/f CGGT CCACT GATGGGT GT T G TCCAGTATGGAGACAAC CCT GC TAC TCACT T TAAC C TCAAGACAC 

"M* ACACGAATTCTCGAGAT CT GAAGACAGCCATAGAGAAAAT TACTCAGAGAGGAGGACTT TCTAAT 

Iff GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 
m GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 
m GACTTGCGAGAGAGTCAGGAATCAACATTTTCTTCATCACCATTGAAGGTGCTGCTGAAAATGAG 
W AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 
U'i GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
C| ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 
fA GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
P- AGAGTTTGAGATTTCCGACACGGACACGCGCATCGGGGCCGTGCAGTACACCTACGAACAGCGGC 
Jjf T GGAGT T T GGGT TCGACAAGTACAGCAGCAAGCC TGACATC CTCAACGCCAT CAAGAGGGTGGGC 

111 TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 

G TCCAAGCCCAACAAGAGGAAGT TAAT GAT CC TCATCACC GACGGGAGGT C C TACGAC GACG T CC 
GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 
GCCCAAGAGGAGCTAGAAGTCATTGCCACTCACCCCGCCAGAGACCACTCCTTCTTTGTGGACGA 
G T T TGAC AACC T CCAT CAGTATGTCCCCAGGAT CATCCAGAACAT T TGTACAGAG T TCAACT CAC 
AGCCTCGGAACTGAATTCAGAGCAGGCAGAGCACCAGCAAGTGCTGCTTTACTAACTGACGTGTT 
GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 
AAAT GT C TTGT TAT TAT TCT T TGCCAT CAT GCT T TT T CATAT TCCAAAAC T T GGAG T TAC AAAGA 
TGATCACAAACGTATAGAATGAGCCAAAAGGCTACAT CAT GT TGAGGG T GCT GGAGAT T T TACAT 
TTTGACAATTGTTTTCAAAATAAATGTTCGGAATACAGTGCAGCCCTTACGACAGGCTTACGTAG 
AGCT T T TGT GAGAT T T T TAAGT T GT TATT T C TGAT T T GAAC TCTGTAACCC TCAGCAAGT T T CAT 
TT TTGT CAT GACAATGTAGGAAT TGCTGAAT TAAATGTT TAGAAGGATGAAAAATAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 3 4 

MRTWLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKCPAG 
CQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 
ESFIVLESKPKKGVTYPSALTYSSSKSPAAQAGETTKAYQRPPIPGTTAQPVTLMQLLAVTVAVA 
T PT TLPRPS PS AAS TT S I PRPQS VGHRS QEMDLWS TA/T YTS SQNRPRADPG I QRQDP S GAAFQKP 
VGADVS LGLYPKE E L S T QS LE P VS LGD PNCK IDLSFLIDGSTSI GKRR FR I QKQLLADVAQALD I 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKI TQRGGLSNVGRAI SFVTKNFFSKANGN 
RSGAPNVVVVMVDGWPTDKVEEASRLARESGINI FFIT IEGAAENEKQYWEPNFANKAVCRTNG 
FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTGLNSAD I GFVIDGSS SVGTGNFRTVLQFVTN 
LTKEFEISDTDTRIGAVQYTYEQRLE FGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
FKKSKPNKRKLMILITDGRSYDDVRIPAMAAHLKGVITYAIGVAWAAQEELEVIATHPARDHSFF 
VDE FDNLHQYVPRI IQNICTEFNSQPRN 

Important features: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 181-200 

N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Ami da ti on site. 

amino acids 304-308 
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FIGURE as 



CCGAGCACAGGAGATT GC CT GCGT T T AG GAGGT GGCTGCGTT GTGGGAAAAGC T AT C AAGGAAG AAAT T GC 
CAAACCATGTCTTTTTTTCTGTTTTCAGAGTAGTTCACAACAGATCTGAGTGTTTTAATTAAGCATGGAAT 
ACAGAAAACAACAAAAAACTTAAGCTTTAATTTCATCTGGAATTCCACAGTTTTCTTAGCTCCCTGGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGAT GT CACT GAGAT C CC TCAAAT GG AG CCTCC T GC TGCTGT CAC TCC TG AG TT TCT TT GTGATGT GGT AC 
CTCAGCCTTCCCCACTACAATGTGATAGAACGCGTGAACTGGATGTACTTCTATGAGTATGAGCCGATTTA 
C AGACAAGACTT TC AC TT CACACT T CG AGAGC ATTC AAAC TG CTCT C ATC AAAATCC AT T TCT G GT CAT TC 
B : TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 

s:sr: 

m TGGTGGGGATATGAGGTTCTTACATTTTTCTTATTAGGCCAAGAGGCTGAAAAGGAAGACAAAATGTTGGC 
f ■ 1 AT T G TC CT T AGAGG AT GAAC AC CT T C T T TAT G GT GAC AT AAT CC G ACAAG AT TTTTT AGACACATATAATA 

fl! ACCT GACCT T GAAAAC C ATT AT GGC ATT CAGGT G GGT AACTG AGTT TT GCCC C AAT GC CAAGT ACGTAAT G 

AAGACAGACACT GAT GTT TT CATC AAT ACTGGC AAT TT AGTGAAGTATCTTTTAAACCTAAACCACTCAGA 
GAAG TT TTTC AC AGGT TATCCT CT AATT GAT AAT T ATTCC T AT AGAGGATTT T ACC AAAAAACC CATAT TT 
CTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAATGTCCAGAGAT 
TTGGTGCCAAGGATCTATGAAATGATGGGTCACGTAAAACCCATCAAGTTTGAAGATGTTTATGTCGGGAT 
CTGT TT GAAT TT ATT AAAAGT GAAC AT T CAT ATT CC AG AAG ACACAAATCT T TT CTTT CT AT AT AGAAT CC 
5 55 ATTTGGATGTCTGTCAACTGAGACGTGTGATTGCAGCCCATGGCTTTTCTTCCAAGGAGATCATCACTTTT 

TGGCAGGTCATGCTAAGGAACACCACATGCCATTAT TAA CTTCACATTCTACAAAAAGCCTAGAAGGACAG 
f|| GATACCTTGTGGAAAGTGTTAAATAAAGTAGGTACTGTGGAAAATTCATGGGGAGGTCAGTGTGCTGGCTT 
|*| AC AC TGAACT GAAACT CATG AAAAACCC AG ACT GGAGACT GGAGGGTT AC ACTT GT GAT T TAT T AGTC AGG 

fl| CCCT TC AAAGAT GATATGTGGAGG AATT AAAT AT AAAGGAAT TGGAGGT TT T TGCT AAAGAAAT T AAT AGG 

ACCAAACAATTTGGACATGTCATTCTGTAGACTAGAATTTCTTAAAAGGGTGTTACTGAGTTATAAGCTCA 
CTAGGC TGTAAAAACAAAAC AATGTAGAGT TTT ATT T ATT GAAC AAT GTAGT CACT T GAAGGT T TT GT GTA 
TATCTTATGTGGATTACCAATTTAAAAATATATGTAGTTCTGTGTCAAAAAACTTCTTCACTGAAGTTATA 
CTGAACAAAATTTTACCTGTTTTTGGTCATTTATAAAGTACTTCAAGATGTTGCAGTATTTCACAGTTATT 
AT T ATT T AAAAT TACTTC AACT TTGTGT TT TT AAAT GTT T TGACGATT TCAAT AC AAGAT AAAAAGGAT AG 
TGAATC ATT C TT TAC ATGCAAACATTT T CC AGT T AC TT AACT GAT C AG TT T ATT AT TGAT AC AT CACT CCA 
TTAATGTAAAGTCATAGGTCATTATTGCATATCAGTAATCTCTTGGACTTTGTTAAATATTTTACTGTGGT 
AATATAGAGAAGAATTAAAGCAAGAAAATCTGAAAA 



III 

SJ 

3 . 
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FIGURE 36 

MASALWTVLPSRMSLRSLKWSL^ 

TLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDKMLA 
LSLEDEHLLYGD I IRQDFLDTYNNLTLKT IMA.FR WTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KP IKFEDVYVGI CLNLLKVNIHI PEDTNLFFLYR IHLDVCQLRRVI AAHGFS SKE 1 1 T FWQVMLR 
NTTCHY 

Important features: 

Type II transmembrane domain: 

amino acids 20-39 

N- glycosylate on sites* 

amino acids 72-76, 154-158, 198-202, 212-216, 326-330 

Glycosaminoglycan attachment site. 

amino acids 239-243 

Ly-6 / u-PAR domain proteins. 

amino acids 23-37 

N-myristoylation site. 

amino acids 271-277 
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FIGURE 37 



CGCTCGGGCACCAGCCGCGGCAAGGATGGAGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCAGCTCCTTCTCATC 
T CGTCCT T GC CAAGAGAGTAC ACAGTCAT TAATGAAGCC T GCCC TGGAGCAGAGT GGAATATC ATGTGTCGGGAGT GC TGTG 
AAT ATGAT CAGAT TGAGTGC GTC TGCC CC GGAAAGAGGGAAGTCGTGGGTTAT AC CATCCC T TGC T GC AGGAATGAG GAGAA 
TGAGTGTGACTCCTGCCTGATCCACCCAGGTTGTACCATCTTTGAAAACTGCAAGAGCTGCCGAAATGGCTCATGGGGGGGT 
ACC T TGGATGACTTC TATGT GAAGGGGTTCTAC TGTGCAGAGTGCCGAGCAGGC T GGTACGGAGGAGAC TGC ATGCGATGTG 
GCCAGGTTCTGCGAGCCCCAAAGGGTCAGATTTTGTTGGAAAGCTATCCCCTAAATGCTCACTGTGAATGGACCATTCATGC 
T AAAC CTGGGT T T GTCATCC AAC TAAGAT TT GTCATGTTGAGTC TGGAGTTT GAC TACATGTGCC AGTATGAC TATGT TGAG 
G TTCGTGATGGAGAC AACCGCGATGGC CAGAT CATC AAGC GTGTC TGT GGCAAC GAGCGGC C AGC TCC T ATCC AGAGCATAG 
GATCC TC AC TC CACGTC C TC T TCC ACTCC GAT GGC T C CAAGAATT TT GAC GGT T T CCATGC CAT T TATGAGGAGATCAC AGC 
ATGCTCCTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTGCCTGCTTGGCAGGC 
TATAC TGGGGAGC GC TGTGAAAATC TC C T TGAAGAAAGAAACTGC TC AGACC C T GGGGGCCCAGTCAATGGGT ACCAGAAAA 
T AAC AGGGGGC C C TGGGC T T ATC AACGGACGC CATGC TAAAAT TGGC ACCGT GGT GTCTTTCTTTTG TAAC AAC TC C TATGT 
TC T T AGT GGCAAT GAGAAAAGAAC T TGC CAGCAGAAT GGAGAGTGGT C AGGGAAACAGC C C ATC TGCATAAAAGCC TGCCGA 
GAAC C AAAGAT T T CAGACC TGGTGAGAAGGAGAGT T C T TC C GATGCAGGTTCAGT CAAGGGAGAC AC CATTACACC AGC TAT 
ACTCAGCGGCCTTCAGC^GCAGAAACTGCAGAGTGCCCCTACCAAGAAGCCAGCCCTTCCCTTTGGAGATCTGCCCATGGG 
ATACCAAC ATC T GCATACCC AGC TCCAGTAT GAGTGC ATC T CACCCT T C TAC C GC C GCC TGGGC AGC AGCAGGAGGACATGT 
C TGAGGAC TGGGAAG TGGAG TGGGCGGGC AC CATCC TGC AT CC C TAT C TGCGGGAAAAT TGAGAACATC AC TGCTCCAAAGA 
C CC AAGGGT TGC GCTGGCCG TGGCAGGC AGC CATC T ACAGGAGGACCAGC GGGGTGCATGACGGC AGC C TACACAAGGGAGC 
GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 
GTC AC CAT GAT CAAGACAGC AGACC TGAAAGT TGTT TTGGGGAAATT C T ACC GGGATGATGACC GGGATGAGAAGAC CATCC 
' AGAGCCTACAGAT TTCT GC TATC AT TC TGCATCCC AACT ATGACC C CAT C C T GC T TGATGC TGACAT CGCCATC C TGAAGC T 
C CTAGACAAGGC C CGTATC AGCACC CGAGTC CAGC C CATC TGC C TC GC TGC CAGTC GGGAT C TC AGC AC TTC C TT C CAGGAG 
T CCC ACAT C AC T GTGGC TGGCTGGAAT GT CC TGGC AGACGT GAGGAGC CC TGGCT TCAAGAAC GACAC ACT GC GC TC TGGGG 
T GGTC AGTGTGGTGGAC TCGCTGC TGT GT GAGGAGC AGC ATGAGGACCATGG CAT CC CAGT GAGTGTC AC T GATAACATGT T 
CTGTGCCAGCTGGGAACCCACTGCCCCTTCTGATATCTGCACTGCAGAGACAGGAGGCATCGCGGCTGTGTCCTTCCCGGGA 
C GAGC ATC TCC TGAGCC ACGC TGGCAT CT GATGGGAC TGGTCAGC TGGAGC TATGAT AAAACATGCAGCCAC AGGCTCTCC A 
C TGCC TTCACCAAGG TGCTGCCT T T TAAAGAC TGGAT TGAAAGAAATATGAA ATGAA CC ATGC TCATGCAC TCC T TGAGAAG 
TGTT TCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCC TGTGAAC T TGGCT 
GTGC C AGGGCT TC TGAC TTC AGGGACAAAAC T CAGT GAAGGGT GAGTAGACC TCC AT TGC TGGTAGGC TGATGCC GCGTC CA 
C TAC TAGGACAGC CAAT TGGAAGATGCCAGGGC TTGC AAGAAGTAAG T TTC T T CAAAGAAGACCATAT ACAAAAC C T CTCCA 
C TC C AC TGACC TGGT GGTC T TCCCC AACT TT CAGT TATAC GAATGCC ATCAGC TT GACCAGGGAAGATC TGGGCT T C ATGAG 
GCCCCTTTTGAGGCTCTCAAGTTCTAGAGAGCTGCCTGTGGGACAGCCCAGGGCAGCAGAGCTGGGATGTGGTGCATGCCTT 
T GTGTACATGGC C AC AGTACAGTC TGGTCCTTTTCC T TC CCCATC TC T TGT AC ACAT TT TAAT AAAATAAGGGTTGGC TTCT 
GAAC TAC AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 38 

MELGCWTQLGLT FLQLLLI S SLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQ I LLE S YPLNAHCEWT IHAKPG FVT QLRFVMLS LE FDYMCQ YD YVEVRDGDNRDGQ 1 1 
KRVCGNERPAP I QS I GS SLHVLFHSDGSKNFDGFHAI YEE I TACS S SPC FHDGTCVLDKAGS YKC 
ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 
KRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLPMQVQSRETPLHQLYSAAFSKQKLQSAPTK 
KPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGKWSGRAPSCIPICGKIENITAP 
KT QGLRWPWQAAI YRRT SGVHDGS LHKGAW FLVCS GALVNERTVVVAAHCVTDLGKVTM I KTADL 
KWLGKFYRDDDRDEKT I QSLQI SAI I LHPNYDP ILLDADIAI LKLLDKARI S TRVQP I CLAASR 
DLSTSFQESHITVAGWWLADVRSPGFKNDTLRSGWSWDSLLCEEQHEDHGIPVSVTDNMFCA 
SWEPTAPSDICTAETGGIAAVSFPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 
ERNMK 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 

EGF-like domain cysteine pattern signature, 
amino acids 260-272 
N-glycosylation sites. 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 

474-480, 491-497, 638-644, 666-672 

Amidation site. 

amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 3Q 



GGT T CCTACAT C CT CT CATCT GAGAAT CAGAGAGCAT AATCT T CTTAC GGGCC CGT GATTTAT TAAC GT GGCT TAAT C 
TGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGG 
TTTGGGCCCTTGTAGCTGACAGAAGGTGGCCAGGGAGAATGCAGCACACTGCTCGGAGA ATGA AGGCGCTTCTGTTGC 
TGGTCTTGCCTTGGCTCAGTCCTGCTAACTACATTGACAATGTGGGCAACCTGCACTTCCTGTATTCAGAACTCTGTA 
AAGGTGCCTCCCACTACGGCCTGACCAAAGATAGGAAGAGGCGCTCACAAGATGGCTGTCCAGACGGCTGTGCGAGCC 
T CACAGCCAC GGCT CCCT CCCCAGAGGTTTCT G CAG C T GCCAC CAT CT C CTTAATGACAGAC GAGC CT G GCCT AGACA 
ACCCTGCCTACGTGTCCTCGGCAGAGGACGGGCAGCCAGCAATCAGCCCAGTGGACTCTGGCCGGAGCAACCGAACTA 
GGGCAC GGCCCTTT GAGAGAT CCACTATTAGAAGCAGAT CAT TTAAAAAAAT AAATCGAGC T TT GAGT GTT CTT CGAA 
GGACAAAGAGCGGGAGTGCAGTTGCCAACCATGCCGACCAGGGCAGGGAAAATTCTGAAAACACCACTGCCCCTGAAG 
T CTT TC CAAGGTT GTAC CACCT GATT C CAGAT G GT GAAATTAC CAGCAT C AAGAT CAAT CGAGTAGATCCCAGTGAAA 
p], GCCT CTCTATTAGGCT GGT GGGAGGTAGC GAAAC CCCACTGGT CCAT AT CAT TATCCAACACAT TTAT C GT GAT GGGG 

2 TGATCGCCAGAGACGGCCGGCTACTGCCAGGAGACATCATTCTAAAGGTCAACGGGATGGACATCAGCAATGTCCCTC 
jJJ ACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCA 

1 1 GCAGGAACAAT GGACAGGC CCC GGAT GCCTAC AGAC C C C GAGAT GACAGCT TTCAT GT GATT CT CAACAAAAGT AGC C 

"■ 

ttj CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCG 
; ; J GTGTGGCATATCGACATGGTCAGCTTGAGGAGAATGACCGTGTGTTAGCCATCAATGGACATGATCTTCGATATGGCA 
&h GCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGC 
GGAGCCCT GACAT CTT TCAGGAAGCCG GCT GGAACAGCAAT GGCAGCT GGT CC CCAGGG CCAG GGGAGAGGAGCAACA 
CTCCCAAGCCCCTCCATCCTACAATTACTTGTCATGAGAAGGTGGTAAATATCCAAAAAGACCCCGGTGAATCTCTCG 
GCAT GACC GT CGCAGG GGGAGCAT CACATAGAGAAT GGGATTT GCCTAT CTAT GT CAT CAGT GT TGAGC CC GGAGGAG 
T CAT AAGCAGAGAT GGAAGAATAAAAACAGGTGACATT TT GTT GAATGT GGAT GGGGT CGAACT GACAGAG GT CAGCC 
GGAGT GAGGCAGT GGCAT T AT TGAAAAGAACATCAT CCT CGATAGTACT CAAAGCTTT GGAAGT CAAAGAGTAT GAGC 
C CC AGGAAGACTG CAGCAGCC CAG CAGC CCT GGACT CCAAC CACAACAT GGCC CCACC CAGT GACT GGT C C CCAT CCT 
G GGT CATGT GGCTGGAAT TAC CACGGT GC TT GTATAACT GT AAAGATAT T GTAT T ACGAAGAAACACAGCT GGAAGTC 
T GGGCT TCTGCATT GTAGGAGGT TAT GAAGAATACAATGGAAACAAAC CT TTTT T CAT CAAAT CCATT GTT GAAGGAA 
CACCAGCATACAAT GAT GGAAGAATTAGAT GT G GT GATAT T CTT CT TGCT GTCAAT GGTAGAAGTACAT CAGGAAT GA 
TACAT G CTTG CTT GGCAAGACTGCT GAAAGAACT TAAAGGAAGAAT TACT CTAACT ATT GT TT CTT GGC CT GGCACTT 
TTT TATAGAAT CAAT GAT GGGTCAGAGGAAAACAGAAAAAT CACAAATAGGCT AAGAAGTT GAAACACTATAT TTATC 
TTGT CAGT TTT T ATAT TT AAAGAAAGAAT ACATT GTAAAAATGTCAGGAAAAGTAT GAT CAT CTAAT GAAAGCCAGTT 
ACACCT CAGAAAAT AT GAT T CCAAAAAAATT AAAACTACTAGTT TTTTTT CAGT GTGGAGGAT TT CTCATTACT CTAC 
AACATT GT T T ATAT TT TTT C TAT T CAATAAAAAGCCCTAAAACAACTAAAATGAT T GAT TT GTATACC C CACT GAATT 
CAAGCT GATTTAAATT T AAAATTT GGTATAT GCTGAAGT CT GCCAAGGGT ACAT TATGGC CATT TT TAAT TTACAGCT 
AAAAT ATTT TT TAAAATGCAT TGCT GAGAAAC GTT GCTT T CAT CAAACAAGAATAAATATTTT T CAGAAGTTAAA 
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FIGURE AO 

MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAAT I S LMTDE PGLDNPAYVS SAEDGQPAI S PVDSGRSNRTRARP FERS T IRS RS FKKI NR 
AL S VLRRT K S G S AVANHAD Q GRE N S ENT T APE VFPRL YHL I P DGE I T S I K I NRVDP S E S L S I RL V 
GGSETPLVHIIIQHIYRDGVIARDGRLLPGDIILKVNGMDISNVPHNYAVRLLRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDSFHVILNKSSPEEQLGIKLVRKVDEPGVFIFNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPDIFQEAGWNSNGSWSPG 
PGERSNTPKPLHPTITCHEKWNIQKDPGESLGMTVAGGASHREWDLPIYVISVEPGGVISRDGR 
I KTGD I LLNVDGVELTEVSRSEAVALLKRT S S S I VLKALE VKE YE PQEDCS S PAALDSNHNMAPP 
SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 
RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

Important features: 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites. 

amino acids 108-112, 157-161, 289-293, 384-388 

Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 

N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE 41 



ACCAGGCATTGTATCTTCAGTTGTCATCAAGTTCGCAATCAGATTGGAAAAGCTCAACTTGAAGCTTT 
C TTGC CTGCAGT GAAGCAGAGAGATAGAT AT TATTCACGTAATAAAAAACATGGGCT TCAACC TGAC T 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAGCAAAGGAGTTCATGGCTAATTTCC 
- . ATAAGACCCTCATT T TGGGGAAGGGAAAAACT CTGAC TAAT GAAGCATCCACGAAGAAGGTAGAACTT ■ 

GACAACTGTCCTTCTGTGTCTCCTTACCTCAGAGGCCAGAGCAAGCTCATTTTCAAACCAGATCTCAC 
TTTGGAAGAGGTACAGGCAGAAAATCCCAAAGTGTCCAGAGGCCGGTATCGCCCTCAGGAATGTAAAG 
CTTTACAGAGGGTCGCCATCCTCGTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAA 
H CATCTGCATCCCTTCCTGCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGG 
C TAAAAAGTTTAATCGAGCCAAACTCTTGAATGTGGGCTATCTAGAAGCCCTCAAGGAAGAAAATTGGG 
III ACTGCTTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGAG 
~f\ CATCCCAAGCATCTGGTGGTTGGCAGGAACAGCACTGGGTACAGGTTACGTTACAGTGGATATTTTGG 
%j GGGT GTTACTGC CCTAAGCAGAGAGCAGT TT TTCAAGGT GAAT GGAT TCTCTAACAACTACTGGGGAT 

|,l GGGGAGGC GAAGACGAT GACC TCAGAC T CAGGGTTGAGCT CCAAAGAAT GAAAAT TTCCC GGCCC CTG 

III CCTGAAGTGGGTAAATATACAATGGTCTTCCACACTAGAGACAAAGGCAATGAGGTGAACGCAGAACG 
W GATGAAGCTCTTACACCAAGTGTCACGAGTCTGGAGAACAGATGGGTTGAGTAGTTGTTCTTATAAAT 
W T AGTATCT GTGGAACACAATCCT TTATAT AT CAACAT CACAGT GGAT TT CT GGT T T GGTGCATGACCC 

r, jj^ ____ 

TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 
Jp H ATAGTAGCACACATTAAGAACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCT 
F| TAGCAGAGCTCCTGGTGATGTAGAGTATAAAACAGTTGTAACAAGACAGCTTTCTTAGTCATTTTGAT 
f!f CATGAGGGTTAAATATTGTAATATGGATACTTGAAGGACTTTATATAAAAGGATGACTCAAAGGATAA 

AATGAACGCTATTTGAGGACTCTGGTTGAAGGAGATTTATTTAAATTTGAAGTAATATATTATGGGAT 
AAAAGGCCACAGGAAATAAGACTGCTGAATGTCTGAGAGAACCAGAGTTGTTCTCGTCCAAGGTAGAA 
AGGTACGAAGATACAATACTGTTATTCATTTATCCTGTACAATCATCTGTGAAGTGGTGGTGTCAGGT 
GAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGGGACGAATCAGGACACAGTGAACTTGGGAATGAAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
CTTCAGGGGAGGACCTGCCCAGGTATGCCTTCCAGTGATGCCCACCAGAGAATACATTCTCTATTAGT 
T TTTAAAGAGTTTTT GTAAAATGATTT TGTACAAGT AGGATAT GAAT TAGCAGTTTACAAGTTTACAT 
ATTAACTAATAATAAATAT GTCTATCAAATAC C TCT GTAGTAAAA.TG TGAAAAAGCAAAA 
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FIGURE 42 

MGFNLT FHLS YKFRLLLLLTLCLTVVGWATSNYFVGAI QE I PKAKE FMANFHKTLILGKGKTLTN 
EAS TKKVELDNCPSVS PYLRGQSKLI FKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAI LVPH 
RNREKHLMYLLEHLHPFLQRQQLDYGIYYIHQAEGKKFNRAKLLNVGYLEALKEENWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 
S VEHNPLY INI TVD FW FGA 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 4-8, 220-224, 335-339 

Xylose isomerase proteins. 

amino acids 191-202 
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FIGURE 43 

GCTCAAGACCCAGCAGTGGGACAGCCAGACAGACGGCACGATGGCACTGAGCTCCCAGATCTGGG 

CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 

ACGGGACAAC T TGCAGAGC TGCAAC C CCAGGACAGAGCTGGAGCCAGGGCCAGC TGGATGCC CAT 

GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 

ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCC 

TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

TCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

WIT 
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FIGURE AA 



MALSSQIWAACLLLLLLLASLTSGSVFPQQTGQLAELQPQDRAGARASWMPMFQRRRRRDTHFPI 
CI FCCGCCHRSKCGMCCKT 

Important features: 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 58-59 




N-myristoylation site. 



amino acids 44-50 



Prokaryotic membrane lipoprotein lipid attachment site. 



amino acids 1-12 
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FIGURE A* 

GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAAT ATGG CTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACC TT CAACACAAC CCCT GT T GT CACCATACAGCCAGAAGGGGGCACTATCATAG TGACCCAAA 
AT CGTAAT AGGGAGAGAGTAGAC T TC CCAGATGGAGGC TAC T CCC T GAAGCT CAGCAAACT GAAG 
AAGAATGACTCAGGGATCTACTATGTGGGGATATACAGCTCATCACTCCAGCAGCCCTCCACCCA 
GGAGTACGTGCTGCATGTGTACGAGCACCTGTCAAAGCCTAAAGTCAGCATGGGTCTGCAGAGCA 
ATAAGAAT GGCACCTG TGT GACCAAT C TGACAT GC TGCAT GGAACATGGGGAAGAGGAT GTGAT T 
TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
CTGGAGATGGGGAGAAAGTGATATGACCTTCATCTGCGTTGCCAGGAACCCTGTCAGCAGAAACT 
TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
GTTTCTGAAGAGAGAGAGACAAGAAGAGTACATTGAAGAGAAGAAGAGAGTGGACATTTGTCGGG 
AAACTCCTAACATATGCCCCCATTCTGGAGAGAACACAGAGTACGACACAATCCCTCACACTAAT 
! AGAACAATCCTAAAGGAAGATCCAGCAAATACGGTTTACTCCACTGTGGAAATACCGAAAAAGAT 
GGAAAATCCCCACTCACTGCTCACGATGGCAGACACACCAAGGCTATTTGCCTATGAGAATGTTA 
TC TAGA CAGCAGTGCAC TCCCCTAAGTCTC TGC T CA 
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FIGURE A6 

MAGS PTCLTL I YI LWQLTGSAASGPVKELVGSVGGAVT FPLKSKVKQVDS I VWTFNTTPLVT IQP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVI YTWKALGQAANESHNGS I LPI SWRWGESDMT FIC 
VARNPVSRNFSSPILARKLCEGAADDPDSSMVLLCLLLVPLLLSLFVLGLFLWFLKRERQEEYIE 
EKKRVD I CRET PN I CPHS GENTE YDT I PHTNRT I LKE DPANTVYS TVE I PKKMENPHS LLTMPDT 
PRLFAYENVI 

Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain : 

amino acids 224-250 

Leucine zipper pattern. 

amino acids 229-251 

N-glycosylation sites. 

amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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FIGURE 47 



o 



m 

o 

III 
CI 

m 



GGC T C GAG C GT T T C T GAGC C AGGGG T GAC C ATGA C C T G CT G C GAAGGAT GG AC AT C C T G CAAT GG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGGTCTGATGGCCAT TCCAGCAACAACAATGT CCTTGACAGCAAGAAAAAGAGCGTG 
CTGCAACAACAGAAC T GGAATGT TTC T TTCATCATT TTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AAC AGT AAT G C CAAT T G T G AAT T T T CAT T G AAAAAC AT C AG T GAC AT T CAT C C AG AAT C C T T CAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
ACACCATGGCGAGTGGCTGGAGAGCATCTAGTTTCCACTTCGATTCTGAAGAAAACAAACATAGG 
CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
T G TAGT TTAAT GGGAATAAAATGTAAGT AT CAG TAG T T T GAAAAAAAAAAA 
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FIGURE 48 

MTCCEGWTSCNGFSLLVLLLLGWLNAIPLIVSLVEEDQFSQNPISCFEWWFPGIIGAGLMAIPA 
TTMSLTARKRAGCNNRTGMFLSSFFSVITVIGALYCMLISIQALLKGPLMCNSPSNSNANCEFSL 
KN I SDI HPE S FNLQWFFNDS CAP PTGFNKPT SNDTMAS GWRAS S FHFDS EENKHRL I HFSVFLGL 
LLVGILEVLFGLSQIVIGFLGCLGGVSKRRSQIV 

Important features: 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 
N-glycosylation sites. 

amino acids 80-34, 132-136, 148-152, 163-167 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 223-227 

N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE 4Q 



ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGT GAGGAACC T GCAGAGTCACACAGT TGCT GACCAAT TGAGC T GT GAGCCT GGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CATTTGCCAATAAAGACGATCCCTTCTACTATGACTGGAAAAACCTGCAGCTGAGCGGACTGATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
CAGCCAGAAGCAGCACAGTCCTGTACCTGAGAAGGCCATCCCACTCATCACTCCAGGCTCTGCCA 
CTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 
TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 
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FIGURE so 

MERVTLALLLLAGLTALEANDPFANKDDPFYYDWKNLQLSGLICGGLLAIAGIAAVLSGKCKYKS 
S QKQHS PVPEKAI PL I T PGS AT TC 

Important features: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 3 6-59 

N-myristoylation sites. 

amino acids 41-47, 45-51, 84-90 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

amino acids 54-67 
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FIGURE si 



GT GGAC T C T GAGAAGC CCAGGCAGT TGAGGACAGGAGAGAGAAGGC TGCAGACCCAGAGGGAGGG 
AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 
AGGAGACCCTGGTGGGAGGAAGACACTCTGGAGAGAGAGGGGGCTGGGCAGAGATGAAGTTCCAG 
GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 
CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGA 
GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 
GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 
AGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATGCTCTGGGAAACACTGGGCACGAGA 
^ TTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCCGCGGCTCCTGGCAGGGG 
O GTGCCTGGCGACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 

CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 
GAAACTCAGCAGGCAGCTTTGGAATGAATCCTCAGGGAGCTCCCTGGGGTCAAGGAGGCAATGGA 
GGGCCACCAAAC T TTGGGACCAACAC TCAGGGAGC TG TGGC CCAGCC T GGC TAT GG TTCAGT GAG 
AGCCAGCAACCAGAATGAAGGGTGCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCA 
II! ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 
w AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 
7^ TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 
f*\ CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 
{|| CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 
Q CTTCAGAGGACAGGGAGTTTCCAGCAACATGAGGGAAATAAGCAAAGAGGGCAATCGCCTCCTTG 
Rl GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 

GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 
GAAT T T T AAAT CCAAGC T GGG T T TCAT CAACT GGGAT GCC ATAAACAAGGAC CAGAGAAGC T C TC 
GCATCCCGTC^CCTCCAGACAAGGAGCCACCAGATTGGATGGGAGCCCCCACACTCCCTCCTTAA 
AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAftAAAAAAAAAAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



pi 
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FIGURE 52 



MKFQGPLACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKVS 
EALGQGTREAVGTGVRQVPGFGAADALGNRVGEAAHA.LGNT GHE I GRQAEDVI RHGADAVRGS WQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGSGESGIQGFRGQGVSSNiyfREISKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGVNTVNSETSPGM 
FNFDTFWKNFKSKLGFINWDAINKDQRSSRIP 

Signal peptide: 
amino acids 1-21 

N-glycosylation site. 

amino acids 265-269 

Glycosaminoglycan attachment site. 

amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 
Casein kinase II phosphorylation site. 

amino acids 26-30, 109-113, 259-263, 300-304, 304-308 



N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298 r 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE 53 



GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATGTCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCGCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 
AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
GAAGT C C T AT ATAACGAT C T T CAACAAGAG T GC AAAC AT CAT GC TTGACAAGTGGCAGC AC CT GG 
Q CCTCAGAGGGCAGCAGTCGTCTGGACATGTTTGAGCACATCAGCCTCATGACCTTGGACAGTCTA 

CAGAAATGCATCT TCAGCT TTGACAGCCAT T GT CAGGAGAGGCC CAGT GAATATAT TGCCAC CAT 
Zl CTTGGAGCTCAGTGCCCTTGTAGAGAAAAGAAGCCAGCATATCCTCCAGCACATGGACTTTCTGT 
%! ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 
N GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 
111 AGCCAAGT CCAAGAC T T T GGAT T T CAT T GATGTGC TTCTGC TGAGCAAGGAT GAAGATGGGAAGG 

* H CATTGTCAGATGAGGATATAAGAGCAGAGGCTGACACCTTCATGTTTGGAGGCCATGACACCACG 
S GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 
p ACAGGAGGT GCAAGAGCT T C TGAAGGACCGCGAT CC TAAAGAGAT TGAATGGGACGACC TGGC CC 

gl AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 
W TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 

CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 
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FIGURE 

MS LLSLPWLGLRPVAMS PWLLLLL WGS WLLAR I LAWT YAFYNNCRRLQC FPQPPKRNWFWGHLG 
L I TPTEEGLKDS TQMSATYSQGFTVWLGPI I PFI VLCHPDT IRS I TNASAAIAPKDNLFIRFLKP 
WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGSSRLDMFEHI 
SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 
RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 
MFGGHDT T AS GL S W VL YNLARHPE YQERCRQE VQE LLKDRDPKE I E WDDLAQL P FL TMC VKE SLR 
LHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTVWPDPEVYDPFRFDPENSKGRSP 
LAF I P FSAGPRNC I GQAFAMAEMKWLALMLLHFRFLPDHT E PRRKLEL IMRAEGGLWLRVE PLN 
VGLQ 

Important features : 
Transmembrane domains: 

amino acids 13-32 (type II), 77-102 

Cytochrome P450 cysteine heme-iron ligand signature. 

amino acids 461-471 

N-glycosylation sites. 

amino acids 112-116, 168-172 
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FIGURE 55 



AT CGCATCAAT TGGGAGTACCAT CT TCCT GATGGGACCAGT GAAACAGC TGAAGCGAAT GT T T GA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAATTCATGGCCAGTT TTATGAAGCT TT GGAAGGCACTATGGACAGAAGCTGGTGGACAGTT TT 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
GTGATTCGAACATTTGAGGGTTACTTTTGGAAGCAACAATACATTCTCGAACCTGAATGTCAGTA 
GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
h&. T C T G GAT G T T G T C C CAC T GAAT T C C CAT GAAT AC AAAC C TAT T C AGC AAC AGC AAAAAAAAAAAA 

W AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

W 

m 
w 

SI 
m 
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FIGURE *6 

MGP VKQLKRMFE PTRL I AT IMVLLC FALTLCS AFWWHNKGLAL I FC I LQS LALT WYSLS FI P FAR 
DAVKKC FAVC LA 

Important features : 
Signal peptide: 

amino acids 1-33 

14 Type II fibronectin collagen-binding domain protein. 

h-4 amino acids 30-72 

© ' 
iy 
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CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAACATGCTCTGTCTGTGCCTG 
TACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTGAGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCC 
ATTTTCAAGCTCAGTGTCTTCATCCCCTCCCAGGAATTCTCCACCTACCGCCAGTGGAAGCAGAAAATTGTACAAGCT 
GGAGATAAG GAC CT T GATGGGCAGCT AGACTT T GAAGAATTT GT CCATTAT CT CC AAGAT CAT GAGAAGAAGCT GAGG 
CT GGT GTTTAAGAT TTT GGACAAAAAGAAT GAT G GACG C ATT GACGCGCAGGAGATCAT G CAGT C CCT GC GGGACTT G 
GGAGT CAAGATAT CT GAACAGC AG GCAGAAAAAAT TCT CAAGAGCAT GGATAAAAAC GGCACGAT GAC CATC GAC TGG 
AACGAGTGGAGAGACTACCACCTCCTCCACCCCGTGGAAAACATCCCCGAGATCATCCTCTACTGGAAGCATTCCACG 
ATCTTTGATGTGGGTGAGAATCTAACGGTCCCGGATGAGTTCACAGTGGAGGAGAGGCAGACGGGGATGTGGTGGAGA 
CACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 
CAGGTCCATGCCTCCCGCAGCAACAACATGGGCATCGTTGGTGGCTTCACTCAGATGATTCGAGAAGGAGGGGCCAGG 
T CACT CT G GC GGGGCAAT GG CATCAAC GT C CTCAAAAT T GCCC C CGAAT CAGCCAT CAAATT CAT GG CCTAT GAGCAG 
ATCAAGCGCCTTGTTGGTAGTGACCAGGAGACTCTGAGGATTCACGAGAGGCTTGTGGCAGGGTCCTTGGCAGGGGCC 
AT CG CCCAGAGCAGCAT CT ACC CAAT GGAG GT C CT GAAGACCC GGAT GGC GCT G C GGAAGACAGGCCAGTACT CAGGA 
ATGCTGGACTGCGCCAGGAGGATCCTGGCCAGAGAGGGGGTGGCCGCCTTCTACAAAGGCTATGTCCCCAACATGCTG 
O GGC AT CAT CC CCT AT GCCG GCAT C GAC CTT GC AGT CTACGAGACGCT CAAGAAT GCCT GGCT GCAGCACTATGCAGT G 

pig AACAGCG CGGAC C CC GGCGT GTT T GTGCTC CT GGC CT GTG GCACCAT GT C CAGTACCT GT GGCCAGCT GGCCAGCTAC 

CCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCTCCGGAGGTGACCATGAGCAGCCTC 
TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 
GCT GT GAGCATCAGC TAC GT GGT CT ACGAGAACCT GAAGAT CACCCT GGGCGT GCAGT CGCGGTGAC GGGGGGAGGG C 
CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 
AAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGTGGGGAGAGCTGGCAGGCCCAGGGCTT 
GTCCTGCTGACCCCAGCAGACCCTCCTGTTGGTTCCAGCGAAGACCACAGGCATTCCTTAGGGTCCAGGGTCAGCAGG 
CTCCGGGCTCACATGTGTAAGGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCT 
TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
ATAATCCATGATGAAAGGTGAGGTCACGTGGCCTCCCAGGCCTGACTTCCCAACCTACAGCATTGACGCCAACTTGGC 
TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 
CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
• JrJ GGCCTGGACCCTGTCAGGATGGGCCCCACCTCAGAACCAAACTCACTGTCCCCACTGTGGCATGAGGGCAGTGGAGCA 
t| CCATGTTTGAGGGCGAAGGGCAGAGCGTTTGTGTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGG 
ff$ AC T GTTGG GAAAAGGGT TT T GT CCAGAAGGACAAGCCG GACAAAT GAGC GACTTCTGT GCTT C CAGAGGAAGACGAG G 

2 GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 
1*1 GGGACCAGCCCCACATT CCACTTGT GT CACT GCTT GGAACCTATTTATTTT GTATT TATT T GAACAGAGT TAT GT CCT 

fill AACTATTTTTATAGATTTGTTTAATTAATAGCTTGTCATTTTCAAGTTCATTTTTTATTCATATTTATGTTCATGGTT 
GATT GTAC CTT C C CAAGCCC GCCCAGT GGGAT GGGAGGAGGAGGAGAAGG GGGGC CTT GGGC C GCT GCAGT CACAT CT 
GT CCAGAGAAAT T CCT TTT GGGAC T GGAGG CAGAAAAG C GG C CAGAAGGCAGC AGCCCT GGCT CCT TT CCTTTGGCAG 
GTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGACTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAAT 
AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCT GCT GC C CTT G CTTAACAAT GCC GGC CAACT GG C GAC C T CACGGTT GCACT T CCATT C CAC CAGAAT GAC CT GA 
TGAGGAAATCTTCAATAGGATGCAAAGATCAATGCAAAAATTGTTATATATGAACATATAACTGGAGTCGTCAAAAAG 
CAAAT TAAGAAAGAAT T GGACGTT AGAAGTT GT CATT TAAAGCAGC CTTCTAATAAAGTT GTT T CAAAGC T GAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 58 

MLCLCLYVPVIGEAQTEFQYFESKGLPAELKS I FKLSVFI PSQEFSTYRQWKQKIVQAGDKDLDG 
QLDFEE FVHYLQDHEKKLRLVFKI LDKKNDGR I DAQE IMQS LRDLGVKI S EQQAEKI LKSMDKNG 
TMT I DWNEWRDYHLLHPVENI PE 1 1 LYWKHS T I FDVGENLTVPDE FTVEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVIjMQVHAS RSNNMG I VGGFTQMI REGGARSLWRGNG INVLK I APES AI K 
FMAYEQ IKRLVGSDQET LRI HERLVAGS LAGAI AQS S I YPME VLKTRMALRKTGQYS GMLDCARR 
I LAREGVAAFYKGYVPNMLGI I PYAGI DLAVYETLKNAWLQHYAVNSADPGVFVLLACGTMS STC 
GQLASYPLALVRTRMQAQASIEGAPEVTMSSLFKHILRTEGAFGLYRGLAPNFMKVI PAVS I SYV 
N» VYENLKITLGVQSR 

m 

pj Important features: 

lal Signal peptide: 

%! amino acids 1-16 

I If! 

Putative transmembrane domains: 

p amino acids 284-304, 339-360, 376-394 

yi 

C| Mitochondrial energy transfer proteins signature. 

Ill amino acids 206-215, 300-309 

fl 

ril 

N-glycosylation sites. 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE 5Q 



GGAAGGCAGCGGCAGCT C CAC T CAGCCAGTACCCAGATACGC T GGGAAC C T TC CCCAGCCATGGC 
TTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTG 
CACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTCGCCTCAGCT 
GG GAAC AT T GGGGAG GAT G GAAT C C T GAG C T GCAC T T T T GAAC C T GAC AT C AAAC T T T C T GAT AT 
CGTGAT ACAAT GGC TGAAGGAAGGTG T T T TAGGC T T GGT CCAT GAG T TCAAAGAAGGCAAAGAT G 
AGC TGT C GGAGCAGGAT GAAAT GT TCAGAGGCCGGACAGCAG T GT T T GC T GAT CAAG TGATAGT T 
GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 
CATCACTTCTAAA.GGCAAGGGGAATGCTAACCTTGAGTATAAAACTGGAGCCTTCAGCATGCCGG 
AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 
jbij CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 
N TCAACAACACATAC TCC T GTATGAT TGAAAATGACAT TGCCAAAGCAACAGGGGATATCAAAG T G 

h" ACAGAATCGGAGATCA?WVGGCGGAGTCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGT 

? ;r~ 

CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAAT 
p| GTGCCTTGGCCACAAAAAAGCATGCAAAGTCATTGTTACAACAGGGATCTACAGAACTATTTCAC 
HI CACCAGATAT GACC TAG T T TTATAT TT CT GGGAGGAAAT GAAT TCATAT C T AGAAGTCT GGAG TG 

CI AGCAAACAAGAGCAAGAAACAA?WVGAAGCCAAAA.GCAGAAGGC T CCAATATGAACAAGATAAAT 

ftp 

j| CTAT CT T CAAAGACATAT TAGAAGT TGGGAAAATAAT T CAT G TGAAC TAGACAAGTG T G T TAAGA 

5| GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 
CACCTGGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGC 
TGTAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
TCT TCCCAACTGACAAATGCCAAAGTTGAGAAAAATGATCATAAT TT TAGCATAAACAGAGCAGT 
CGGGGACACCGATTTTATAAATAAACTGAGCACCTTCTTTTTAAACAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 60 



ill 

W 

II! 



MASLGQILFWSIISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 

DIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKC 

YIITSKGKGNANLEYKTGAFSMPEVNVDYNASSETLRCEAPRWFPQPTWWASQVDQGANFSEVS 

NTSFELNSEWTMKWSVLYNVTINNTYSCMIE 

CVS S FFAI SWALL PLS PYLMLK 

Important features : 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 258-281 

N-glycosylation sites. 



n amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 

|f| 220-224 

CI 

PI N-myristoylation sites. 

|f| amino acids 52-58, 126-132, 188-194 

rii 
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FIGURE 61 



TGACGT CAGAATCAC CATGGCCAGC TAT CCT TACCGGCAGGGC TGCCCAGGAGCTGCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
GGTGGACTCAGATCACAGTGGCTATATCTCCATGAAGGAGCTAAAGCAGGCCCTGGTCAACTGCA 
AT T GGT CT T CAT TCAAT GAT GAGAC C T GCC T CAT GAT GAT AAACAT GT T T GACAAGACC AAGT CA 
GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 
CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 
AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 
GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 
GGCCTTCCGGGAGAAGGACACAGCTGTACAAGGCAACATCCGGCTCAGCTTCGAGGACTTCGTCA 
CCATGACAGCTTCTCGGATGCT ATGA CCCAACCATCTGTGGAGAGTGGAGTGCACCAGGGACCTT 
L TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAGAAC 
m ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 
P| AATAGTGAGGACCGGGGCTGAGGCCACACAGATAGGGGCCTGATGGAGGAGAGGATAGAAGTTGA 
III ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 
M AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATeGGCCTGTTACC 

m 

GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
C TGACC AAGC ATGAGAGAGAT C T GT C TAT GGGACC AG T GGC T TGGAT TC T GCC ACACCCAT AAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
ATTTGGGGCCAAAAGTCCAGTGAAATTGTAAGCTTCAATAAAAGGATGAAACTCTGA 



III 
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FIGURE 62 



MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
SGYISMKELKQALWCNWSSFNDETCI^INMFDKTKSGRIDVYGFSALWKFIQQWKNLFQQYDR 
DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DTAVQGNIRLSFEDFVTMTASRML 



Important features of the protein: 
m Signal peptide: 

U amino acids 1-19 



! a 2 



HI 



N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 



Q N-myristoylation sites. 



amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 63 



CAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 
AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 
TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 

M TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 

f 1 

CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 
TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 
Sj ATCTGGGGTATCAATAATCAACCCCCAGGTACCAGCTGGGGAAATATTAATCGGTATCCAGGAGG 
N 5 CAGCTGGGGAAATATTAATCGGTATCCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 
** s GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 
m CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 
Iff GGGCTAGAGCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGGCCCTTGCTG 
CJ TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCGTCCCC 

T GC TCAT C TCCAATAAAAT AAAAGCAC T TAT GAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAATiAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



01 



f:;1 
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FIGURE 64 



MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDIJUWPLKLSVPPSDGFPPAGGSAVQRWPPSW 

LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 

Important features of* the protein: 
Signal peptide: 

jUl amino acids 1-26 

hi Casein kinase II phosphorylation sites. 

Si amino acids 56-59, 155-158 

m ' 

N-myristoylation sites. 

p.. amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 

|f| 259-264, 269-274, 270-275, 280-285, 281-286, 305-310 

■Ml • 

Ml 
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FIGURE 6s 



m 

1-3 

w 



01 



AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTATGGGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACGCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 
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FIGURE 66 



MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARSQHHWCNT 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 37-41 

N-myr i s toyl a ti on s i te s . 

amino acids 15-21, 19-25, 60-66 
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FIGURE 67 

ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
to GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
3 GCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 

jjf GGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAGTGCTTAATAGCAGGGAAGAAGGTACTTCAA 

II 

AGACTC TGCCCC TGAGGT CAAGAGAGGAT GGGGCTAT T CAC T T T TAT AT AT T TATATAAAAT TAG 
,J TAGTGAGATGTAAAAAAAAAAAAAAAAAA 



g US 

ru 
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FIGURE 68 

MANPGLGLLLALGLPFLLARWGRAWGQIQTTSANENSTVLPSSTSSSSDGNLRPEAITAIIWFS 
LLAALLLAVGLALLVRKLREKRQTEGTYRPSSEEQFSHAAEARAPQDSKETVQGCLPI 

Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 56-80 

N-glycosylation site. 

amino acids 36-40 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 8 6-90 

Tyrosine kinase phosphorylation site. 

amino acids 86-94 

N-myristoylation sites . 

amino acids 7-13, 26-32 
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FIGURE 6q 



GCCAGGAAT AACT AGAGAGGAACA ATG GGGTTAT T CAGAGGT TTT GTTT T CCT CT TAGT T CT GT GCCT GCTGCAC CAG 
TCAAATACTT CCT T CATT AAG CT GAAT AAT AAT GGCT T T GAAGATATT GT CATT GTTAT AGAT CCT AGT GT GCCAGAA 
GAT GAAAAAATAAT T GAACAAAT AGAGGAT AT G GT GACTACAGCTT CT ACGT AC CT GTTT GAAG C CACAGAAAAAAGA 
TTTT TTT T CAAAAAT GT AT CTAT ATTAAT T CCT GAGAATT GGAAGGAAAATC CT CAGTACAAAAGGCCAAAACAT GAA 
AAC CATAAACAT GCT GAT GT TAT AGTT G CACCACCTACACTCCCAGGTAGAGAT GAAC CATACAC CAAGCAGTTCACA 
GAAT GTGGAGAGAAAGG C GAAT ACATT CACTT CACC CCT GACCT T CTAC T TG GAAAAAAACAAAAT GAAT AT GGACCA 
CCAGGCAAACTGTTTGTCCATGAGTGGGCTCACCTCCGGTGGGGAGTGTTTGATGAGTACAATGAAGATCAGCCTTTC' 
TACCGTGCT AAGT CAAAAAAAAT C GAAGCAACAAGGT GTT CCGCAGGTAT CT CT G GT AGAAATAGAGT TT ATAAGT GT 
CAAGGAGGCAGCT GT CTT AGTAGAGCAT GCAGAAT TGATT CT ACAACAAAACT GT AT G GAAAAGATT GT CAATT CTTT 
CCT GATAAAGTACAAACAGAAAAAGCAT CCATAAT GT T TAT GCAAAGTAT T GATT CT GTT GTT GAATT TT GTAAC GAA 
AAAAC CCATAAT CAAGAAGCT CCAAGC C TACAAAACAT AAAGT GCAAT TTTAGAAGTACAT GGGAGGT GATTAGCAAT 
TCT GAGGATT TT AAAAACAC CATACCCAT GGTGACACC ACCT C CT CCACCT GT CT T CT CAT T GCT GAAGAT CAGT GAA. 
AGAAT T GT GT GCTT AGT T CT T GAT AAGT CT GGAAG CAT GGGGGGTAAGGACC GCCTAAAT C GAAT GAAT CAAGCAGCA 
AAACATT T CCT G CT GCAGACT GT T GAAAAT GGAT C CT GGGT GGGGAT GGT TCACT TT GATAGTACT GCCACTATT GT A 
AAT AAGCTAAT C CAAAT AAAAAGCAGT GAT GAAAGAAACACAC T C AT GGCAGGAT T ACCT ACATAT C CTCT GG GAGGA 
ACT T CCAT CT GCT CT GGAAT T AAAT AT GCATTT CAGGT GATT GGAGAGCTAC ATT C C CAAC T C GAT GGAT C C GAAGTA 
CT GCT GCT GACT GAT GGGGAGGATAACACT GCAAGTTCTT GTATT GAT GAAGT GAAACAAAGT GGGGCCATT GTT CAT 
IJI TTT AT T GCT T T GGGAAGAGCT GCTGAT GAAGCAGT AATAGAGAT GAGCAAGAT AAC AGGAGGAAGTCATT TTTAT GT T 

5 !: ii TCAGATGAAGCTCAGAACAATGGCCTCATTGATGCTTTTGGGGCTCTTACATCAGGAAATACTGATCTCTCCCAGAAG 
Jy* TCCCTTCAGCTCGAAAGTAAGGGATTAACACTGAATAGTAATGCCTGGATGAACGACACTGTCATAATTGATAGTACA 
%l GTGGGAAAGGACACGTTCTTTCTCATCACATGGAACAGTCTGCCTCCCAGTATTTCTCTCTGGGATCCCAGTGGAACA 
AT AAT G GAAAATTT CAC AGT G GATGCAACT TC CAAAAT GGC CT AT CT CAGTATTCCAGGAACT GCAAAGGT GGGCACT 
j-jj TGGGCATACAAT CTT CAAGC CAAAGC GAAC CCAGAAACATTAACTAT T ACAGTAACT T CT CGAGCAG CAAATT CTT C T 

|) | GT GCCT CC AAT C ACAGT GAAT GCT AAAATGAATAAG GACGTAAACAGT TT CC CCAGCC CAAT GATT GT TTAC GCAGAA 

ATTCTACAAGGATATGTACCTGTTCTTGGAGCCAATGTGACTGCTTTCATTGAATCACAGAATGGACATACAGAAGTT 
_ TT GGAACTTTTGGATAAT GGT GCAGGCG CT GAT TCTT T CAAGAAT GAT GGAGT CT ACT CCAG GT AT TT T ACAGCATAT 

CI ACAGAAAAT GGCAGAT ATAGC TT AAAAGTT CG GGCT CATG GAGGAGC AAACACT GC CAGGCTAAAAT T ACG GC CT C CA 

||| CTGAATAGAGCCGCGTACATACCAGGCTGGGTAGTGAACGGGGAAATTGAAGCAAACCCGCCAAGACCTGAAATTGAT 
GAGGATACTCAGACCACCTTGGAGGATTTCAGCCGAACAGCATCCGGAGGTGCATTTGTGGTATCACAAGTCCCAAGC 
CTT CC CT T GCCT GACCAATACCCACCAAGT CAAAT CACAGACCTTGAT GC CACAGT T CAT GAGGATAAGAT TATT CT T 
p| ACAT GGACAGCAC CAGGAGAT AATTTT GAT GTT GGAAAAGT T CAAC GT TAT AT CATAAGAATAAGT GCAAGTATT CTT 

2f GAT CTAAGAGACAGTT T T GAT GATGCT CTT CAAGTAAAT ACT ACT GAT CT GT CAC C AAAGGAGGC CAACT C CAAGGAA 

y AGCTT TGCAT T TAAAC CAGAAAATAT CT CAGAAGAAAAT GCAACC CACATAT T TAT T G C CAT TAAAAGTAT AGATAAA 

pjj AGCAATTTGACATCAAAAGTATCCAACATTGCACAAGTAACTTTGTTTATCCCTCAAGCAAATCCTGATGACATTGAT 
rW CCTACACCTACTCCTACTCCTACTCCTACTCCTGATAAAAGTCATAATTCTGGAGTTAATATTTCTACGCTGGTATTG 
TCT GT GATTGGGT CT GT T GTAATTGTTAACTT TATT TT AAGTACCAC CAT T TGAA CCTTAAC GAAGAAAAAAATCTT C 
AAGTAGAC CTAGAAGAGAGT TTTAAAAAACAAAACAAT GTAAGTAAAGGATAT T T C T GAAT CT TAAAATT CAT CCCAT 
GT GTGAT CAT AAACT CAT AAAAAT AATT TT AAGAT GT C GGAAAAGGATACT TT GAT TAAATAAAAACACT CAT GGATA 
TGT AAAAACTGT C AAGAT TAAAATTTAATAGTTT CAT TTATT T GTTAT TT TATTT GTAAGAAATAGT GAT GAACAAAG 
AT C CT TT TT CATACT GAT ACCT GGTT GTATATTATTT GAT GCAACAGT TT T CT GAAAT GAT AT TT CAAATT G CAT CAA 
GAAAT TAAAATCATCTAT CT GAGTAGTCAAAAT ACAAGTAAAGGAGAGCAAATAAACAACAT T T GGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



M 
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FIGURE 70 



MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IH FT PDLLLGKKQNE YGPPGKL FVHEWAHLRWGVFDE YNEDQP FYRAKSKKI EATRC SAG I S GRN 
RVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKASIMFKQSIDSWEFCNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLiytAGLPTYPLGGTSICSGIK 
YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
SHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNAWMNDTVI IDSTVGKDTFFL 
I TWNSLPPS I SLWDPSGT IMENFTVDATSKMAYLS I PGTAKVGTWAYNLQAKANPETLT I TVTSR 
AANS S VP P I TVNAKMNKDVNS FPSPMI VYAE I LQG YVP VLGANVTAF IES QNGHTE VLELLDNGA 
GADSFKNDGVYSRYFTAYTENGRYSLPCVRAHGGANTARLKLRPPLNRAAYIPGWVVNGEIEANPP 
%l RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
FDVGKVQRYI IRI SAS I LDLRDSFDDALQWTTDLS PKEANSKE S FAFKPENI SEENATHI FIAI 
ill KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 
5 ?v VNFILSTTI 

5 . 

Ml 

f| Signal peptide: 

|0. amino acids 1-21 



IS « 

s 



Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 



N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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FIGURE 71 



CT CCT TAGGT GGAAACC CT G GGAGTAGAGT ACTGACAGC AAAGAC CG GGAAAGACCAT AC GT C CC CGGGCAG GGGTGA 
CAACAGGT GT CAT CTTTTT GAT CT CGT GT GT GG CT GC CTT CCTATTT CAAGGAAAGACGC CAAGGTAATT TT GAC C CA 
GAGGAGCAAT GAT GT AGCC ACCT C CTAAC CTT C C CTT CTT GAAC C CC CAGTTAT GC CAGGATT T ACTAGAGAGTGT CA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GTGCTCAGAAGTGACTTGCCTGAGGGTGGACCAGAAGAAAGGAAAGGTCCCCTCTTGCTGTTGGCTGCACATCAGGAA 
GGCT GT GAT GGGAAT GAAGGTGAAAACTT GGAGAT TT CACTT C AGTCATT GCT T CT GC CT GCAAGAT CAT C CTTT AAA 
AGTAGAGAAGCT GCTCT GTGT GGTGGT TAACT CC AAGAGG C AGAACT C GT TC T AGAAGGAAAT GGAT G CAAG CAG CT C 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
CTGGCACTGCCCAGGGCCAACAGCCCCACGGGGAAGGAGGGGTACCAGGCCGTCCTTCAGGAGTGGGAGGAGCAGCAC 
CG CAACTACGTGAGCAGC CT GAAGCG GCAGAT CGCACAGCT CAAGGAGGAGCTGCAGGAGAGGAGTGAGCAGCTCAGG 
g : AATGGGCAGTACCAAGCCAGCGATGCTGCTGGCCTGGGTCTGGACAGGAGCCCCCCAGAGAAAACCCAGGCCGACCTC 
CT GGC CT T CCTGCACTC GC AGGT GGACAAGGCAGAGGT GAAT GCT GGCGT CAAGCTGGCCACAGAGTATGCAGCAGT G 
Q CCT TT CGATAGCT T T ACT CTACAGAAGGT GTAC CAGCTGGAGACT GGCCT TAC CCGCCAC C C CGAG GAGAAGC CT GT G 

AGGAAGGACAAGC GGGAT GAGT T GGT GGAAGCCAT T GAAT CAGCCT T GGAGAC CCT GAACAAT CCT GCAGAGAAC AGC 
CC CAAT CACC GT C CTTACACGGC CT CT GAT T T CATAGAAGG GAT CTACCGAACAGAAAGG GACAAAGGGAC ATTGT AT 
|f I GAGCT CAC CTT CAAAGGGGAC CACAAACAC GAATT CAAACG GCTCATCTT ATT TCGAC CAT T CAGCC CCAT CAT GAAA 

| a « GT GAAAAAT GAAAAGCT CAAC AT GGC CAACAC G CTTAT CAAT GTTAT C GT GC CT CTAGCAAAAAGGGT GGACAAGTT C 

' CGGCAGTTCAT GCAGAATTT CAG GGAGATGT GCATT GAGCAGGAT GGGAGAGT CCAT CTC ACT GT T GTTTACTTT GGG 

%l AAAGAAGAAAT AAAT GAAGT CAAAGGAATACTT GAAAACACT T CCAAAGCTGCCAACTTCAGGAACTT TACCTTCAT C 

.5^ CAGCTGAAT GGAGAATT T T CT CG GGGAAAGGGACTT GATGT T GGAGCC CGCTTCT GGAAGGGAAGCAACGT CCTT CT C 

fl„ TTT TT CT GT GAT GT GGACAT CT ACTTCACAT CT GAAT T CCT CAATAC GT GT AGGCT GAATACACAGC CAGGGAAGAAG 

111 . GT AT T TTATCCAGTT C T TTT CAGTC AGTACAAT CCT GGC AT AATAT ACGGC CACCAT GAT GCAGT CC CTCC CTTGGAA 

CAG CAGCT GGTCATAAAGAAGGAAACT GGATTT TGGAGAGACTTT GGATTT GGGAT GACGT GT CAGTAT CGGT CAGAC 
* st . TT CAT CAATAT AGGT GGGTT TGATCT G GACAT CAAAGGCT GGG GC GGAGAGGAT GT GCAC CTT TAT C GCAAGT AT CTC 

O CACAGCAACCTCATAGTGGTACGGACGCCTGTGCGAGGACTCTTCCACCTCTGGCATGAGAAGCGCTGCATGGACGAG 
| |f CTGACC CCCGAGCAGTACAAGAT GTGCATGCAGT CCAAGGC CATGAACGAG GCAT CCCAC GGCCAG CT GGGCATG CT G 

GTGTTCAGGCACGAGATAGAGGCTCACCTTCGCAAACAGAAACAGAAGACAAGTAGCAAAAAAACATGAACTCCCAGA 
C| GAAGGATT GT GGGAGACACT TTT TCTT TCCT T T TGC AATTACT GAAAGTG GCT GCAACAGAGAAAAGACT T CCATAAA 

p! GGACGACAAAAGAATT GGACT GAT GG GTCAGAGAT GAGAAAGC CT C C GATT T CTCT CT GTT GGGCTT TTT ACAACAGA 

J! AAT CAAAAT CT CC GCTTT GCCT GCAAAAGT AACCCAGT TGCACCCT GT GAAGT GT CT GAC AAAGG CAGAAT GCTT GT G 

O AGATTATAAGCCT AAT GGT GTGGAGGTTTT GAT GGT GTTT ACAATACACT GAGAC CT GTT GT TTT GT GTG CTCAT T GA 

fn ■ AATAT T CATGATTTAAGAGCAGTTTT GTAAAAAATT CATTAG CAT GAAAGGCAAGC AT ATT T CT C CT CATAT GAAT GA 

* • ■ GCCTAT CAGCAGGGCT CTAGTT T CTAGGAAT GCT AAAATAT CAGAAGGCAGGAGAGGAGATAGGCT TATTAT GAT ACT 

AGT GAGTACAT TAAGT AAAATAAAAT GGAC CAGAAAAGAAAAGAAAC CATAAATAT C GT GT CATAT TTTCC CCAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
TT T TT T CCCTT GT GAGTTAT AGT CT GCTTAT TT AATTACCAC T TT GCAAGCCTTACAAGAGAGCACAAGT T GGCCTAC 
ATT TT TAT ATT TTTTAAGAAGATACT T T GAGAT GCATT AT GAGAACTT TCAGTT CAAAGCATCAAATT GAT GCCATAT 
CCAAGGACAT GCCAAAT GCT GATTCT GT CAGGCACT GAAT GT CAGGC ATT GAGACATAGGGAAGGAAT GGTTT GTACT 
AATACAGACGTACAGATACTTT CT CT GAAGAGTAT TT T CGAAGAGGAGCAACT GAACACT GGAGGAAAAGAAAAT GAC 
ACT TT CT GCTT TACAGAAAAGGAAACT C ATT CAGACT GGT GAT AT CGT GATGT ACCTAAAAGT CAGAAAC CACATT T T 
CT CCT CAGAAGT AGGGACCG CT TT C T TAC C TGT TT AAATAAAC CAAAGTAT ACC GT GT GAACCAAACAAT CT CTTT T C 
AAAACAGGGT GCT CCTC CT G GCTTCT GGCT T C CAT AAGAAGAAAT GGAGAAAAATATAT ATAT ATATATATATATT GT 
GAAAGATCAATCCATCTGCCAGAATCTAGTGGGATGGAAGTTTTTGCTACATGTTATCCACCCCAGGCCAGGTGGAAG 
TAACT GAATTATTTTT TAAATTAAGCAGTT CTACT CAAT CACCAAGAT GCTT CT GAAAAT T GCATTTTATTACCAT T T 
CAAACTATTT TTTAAAAATAAATAC AGTTAACATAGAGTGGT T T CTTCATT CAT GT GAAAATTATTAGCCAGCAC CAG 
AT GCAT GAGCTAATT AT CTCT TT GAGT CCT T GCTT CT GTTT GCTCACAGTAAACT CATT GTTTAAAAGCT T CAAGAAC 
ATT CAAGCT GTT GGT GT GTTAAAAAAT G CATT GTATT GATTT GTACT G GTAGT TT AT GAAATTTAATTAAAACACAGG 
CCATGAAT GGAAGGT G GTATT GCACAGCTAAT AAAATAT GAT TT GT GGAT ATGAA 
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FIGURE 72 

MMMTORGLLAWISRVVVLLV^ 

EQHRNWSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 
AEVNAGVKLATE YAAVPFDS FTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIE SALETLNNPA 
ENS PNHRP YTAS DFIEGI YRTERDKG T LYELT FKGDHKHE FKRL I L FRP FS P IMKVKNEKLNMAN 
TLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTWYFGKEEINEVKGILENTSKAANFRNF 
TFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 
NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 
KYLHSNLIVVRTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHL 
RKQKQKTSSKKT 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 315-319, 324-328 

N-myristoylation sites. 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Amidation site. 

amino acids 377-381 
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FIGURE 73 

GAGACTGCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGATTTGTCCTGGGGATCCA 
GAAACCCATGATAC CCTAC TGAACACCGAAT CC CC TGGAAGCC CACAGAGACAGAGACAGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 
GGAAACAATGCCCAGTCGCCCATCGATATTCAGACAGACAGTGTGACATTTGACCCTGATTTGCC 
TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
GCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATCCCCAGGGGGGTCAGAACACCAGATCAACAG 
TGAAGCCACATTTGCAGAGCTCCACATTGTACATTATGACTCTGATTCCTATGACAGCTTGAGTG 
AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
AATATAGC T TATGAACACAT TC TGAGTCACT T GCAT GAAGTCAGGCATAAAGATCAGAAGACC T C 
AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 
AT T T CAAT GGAACAGCT GGAAAAGC T TCAGGGGACATTGT TCT CCACAGAAGAGGAGCCC T C TAA 
GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 
CCGAAAGAGTGTGGT CT TCACCTCAGCACAAGCCACGAC T GAGGCATAAAT T CC T TCT CAGATAC 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACACTGTAGGAGTAGTAAGCAGATGTCCTCCTTCCCCTGGACATCTCTTAGAGAGGAAT 
GGACCCAGGCT GTCAT TCCAGGAAGAACT GCAGAGCC T TCAGCC TC T CCAAACAT GTAGGAGGAA 
ATGAGGAAAT CGCTG TGT TGT T AAT GCAGAGANCAAAC TC TGT T TAG T T GCAGGGGAAGT T TGGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
CT CCT T AGGATAAAGAGT TGC T GTT GAAGT TG TATAT T T T T GAT CAATATAT T TGGAAAT TAAAG 
TTTCTGACTTT 



APP_ID=10063715 



Page 227 



FIGURE 7 A 

MLFSALLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFASFIQAGSSYTTGEMLSLGVGILVGCLCLLLAVYFIARKIRKKRLENRKS 
WFTSAQATTEA 

Important features of the protein: 
Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 291-310 

N-glycosylation site. 

amino acids 213-216 

Eukaryotic-type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 
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FIGURE zs 



o 



TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
TCTGGTGGTTTGCCTAAACCTGCAAACATCACCTTCTTATCCATCAACATGAAGAATGTCCTACA 
AT GGAC T CC AC CAGAGGGTCTT CAAGGAG T TAAAG T TAC T TACAC TG TGCAGTAT T TCAT CACAA 
ATTGGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCATTTCTGTTGTCCTGACAGCTCC 
AGAGAAGTGGAAGAGAAAT CCAGAAGACCTT CC TGTT TC CATGCAACAAATAT AC TCCAAT CTGA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 
CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 
TCTGTGATGGGCTATTCCATCTACCGATATATCCACGTTGGCAAAGAGAAACACCCAGCAAATTT 
GATTTTGATTTATGGAAATGAATTTGACAAAAGATTCTTTGTGCCTGCTGAAAAAATCGTGATTA 
ACTTTATCACCCTCAATATCTCGGATGATTCTAAAATTTCTCATCAGGATATGAGTTTACTGGGA 
AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 
GGAAGAGGAGGTGAAACAT TTAGGGTATGCTTCGCAT TTGATGGAAAT TT TTTGTGAC TCTGAAG 
AAAACACGGAAGGTACTTCTCTCACCCAGCAAGAGTCCCTCAGCAGAACAATACCCCCGGATAAA 
ACAGT CAT T GAATAT GAAT ATGAT GT C AGAACCAC T GACATT T GTGCGGGGCCT GAAGAGCAGGA 
GCT CAGT TT GCAGGAGGAGGTGTCCACACAAGGAACAT TAT T GGAGT CGCAGGCAGCGT TGGCAG 
TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 
CAGGAGCACACAGACTCGGAGGAGGGGCCGGAGGAAGAGCCATCGACGACCCTGGTCGACTGGGA 
TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 
CCAGACAGGCCACCAGGAGAAAATGAAACCTATCTCATGCAAT TCAT G GAG GAAT GGGGGT TATA 
TGTGCAGATGGAAAACTGATGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAGTCAATAAATGTTTGCCAGACTGGGTGCAGAATTTATTCAGGTGGGTGT 
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FIGURE 76 



m 



MSYNGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISWLTAPEKWKRNPEDLPVSMQQIY 
SNLECYOTSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 
DQS SE FKAKI I FWYVLP I S I TVFLFSVMGYS I YRYI HVGKEKHPANL I L I YGNE FDKRFFVPAEK 
IVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 



leaf 

fs% Signal peptide: 

k| amino acids 1-28 



Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 

GAGGAGCGGGCCGAGGACTCCAGCGT GCCCAGGTC TGGCATCCT GCACTT GCTGCCC TC TGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGCTCAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
h& TCGTGGAGTTCCACATGACGACTGAGGCCCAAGCCACCATCCGCATGGACACCAGTGCAAGTGGC 
O CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
** TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
! B 'a CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 
\| CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
M TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
jsr| TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
y| ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
Q T GAT CAAT GAAAAGGC TGCAGATAAGCTGGGAT CTAC CCAGAT CGTGAAGAT CC TAAC TCAGGAC 

CIS ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
2* TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 

TTTACACCAAAGGTGACCAACTTATACTCAACTTGAATAACATCAGCTCTGATCGGATGCAGCTG 
ATGAACTCTGGGATTGGCTGGTTCCAACCTGATGTTCTGAAAAACATCATCACTGAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 
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FIGURE 78 



mGPWTFTLLCGLLAATLIQATLSPTAVLILGPKVIKEKLTQELKDHNATSILQQLPLLSAMREK 
PAGG I PVLGSLVNT VLKH 1 1 WLKVI TANI LQLQVKPSANDQELLVKI PLDMVAGFNT PLVKT I VE 
FHMTTEAQATIRMDTSASGPTRLVLSDCATSHGSLRIQLLYKLSFLVNALAKQVMNLLVPSLPNL 
VKNQLCPVI E AS FNGMYADLLQLVKVP I SLS I DRLE FDLL YPAI KGDT I QLYLGAKLLDSQGKVT 
KWFNNSAASLTMPTLDNIPFSLIVSQDWKAAVAAVLSPEEFMVLLDSVLPESAHRLKSSIGLIN 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQLILNLNNISSDRIQLMNSGIGWFQPDVLKNIITEIIHSILLPNQNGKLRSGVPVSLVKALG 
H FEAAESSLTKDALVLTPASLWKPSSPVSQ 

Important features of the protein: 
III Signal peptide: 

Si amino acids 1-21 



If! 



ru 



N-glycosylation sites. 

amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site. 

amino acids 412-415 

LBP / BPI / CETP family proteins. 

amino acids 407-457 
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FIGURE 7Q 

GAGAGAAGTCAGCC TGGCAGAGAGACTCT GAAATGAGGGATTAGAGGTGT T CAAGGAGCAAGAGC ' 
TTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGCCATGGCCTCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CCAAGGGCCT C TGGATGGAATGTGCCACACACAGCACAGGCAT CACCCAGT GT GACAT CT AT AGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 
CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 
CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 
GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTG 
GGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGT 
GTCAGAAGGTGCTGCTGAGGATAGACTGACTTTGGCCATTGGATTGAGCAAAGGCAGAAATGGGG 
GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 
GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 
TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCGATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GACTAAT T T GTGCAT GAAC T GAAATAAAACCATCCTACGGTATCCAGGGAACAGAAAGCAGGATG 
CAGGATGGGAGGACAGGAAGGCAGCCTGGGACAT TTAAAAAAATA 
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FIGURE 80 



MAS LGLQLVGY I LGLLGLLGTLVAMLLPS WKT S S YVGAS I VTAVGFS KGLWMECATHS TGI TQCD 
I Y S TLLGLPAD I QAAQAMMVTS SAI S S LAC 1 1 S WGMRCT VFCQE SRAKDRVAVAGGVFF I LGGL 
LG F I PVAWNLHG I LRDFYS PLVPDSMKFE I GEAL YLG 1 1 S S L FS L I AG 1 1 LC FS CS S QRNRSNY Y 
DAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein: 
Signal peptide: 
amino acids 1-24 

Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site. 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins, 

amino acids 46-59 
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FIGURE 81 

CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 
ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 
GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 
TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 
AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 
ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 
ACTGCCAGACTCCTGCTGCCAAGCCTGCAAAGATGAGGCAAGTGAGCAATCGGATGAAGAGGACA 
GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 
AGAGGCCCGGGCACCCGAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 
CAGACCCAAGGGAGCAGGCAGCACAACTGTCAAGATCGTCCTGAAGGAGAAACATAAGAAAGCCT 
GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 
CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 
ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
CTCGGACTTGGTGGAGATCTACCTCTGGAAGCTGGTAAAAGATGAGGAAACTGAGGCTCAGAGAG 
GTGAAGTACCTGGCCCAAGGCCACACAGCCAGAATCTTCCACTTGACTCAGATCAAGAAAGTCAG 
GAAGCAAGACTTCCAGAAAGAGGCACAGCACTTCCGACTGCTCGCTGGCCCCCACGAAGGTCACT 
GGAACGT C T T CC TAGCCCAGACC CT GGAGC TGAAGGT CACGGCCAGT CCAGACAAAGTGACCAAG 
ACATAACAAAGACC TAA CAGT T GCAGATATGAGCTGT ATAATT GT T GT TAT TAT AT AT TAATAAA 
TAAGAAGTTGCATTACCCTCAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 82 



MVPEWVLSSLLGLALLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSE 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
ISSTRCPKAPGRVLVHTSVSPSPDNLRRFALEHEASDLVEIYLWKLVKDEETEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 
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FIGURE 83 



GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TCCGTTTCCATGCCGTGAGGTCCATTCACAGAACACATCCATGGCTCTCATGCTCAGTTTGGTTCTGAGTC 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CC AGT T CT CT AGCG TGGT CC ACCT CT AC AGGG ACGGGAAGGACC AGCC AT TT AT GC AG AT GC CAC AGT AT C 
AAGGCAGGACAAAACTGGTGAAGGATTCTATTGCGGAGGGGCGCATCTCTCTGAGGCTGGAAAACATTACT 
GTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCATCTGGGAGCT 
AC AGGT GT CAGC ACT GGG CT C AGT TC CTCT C ATTT C CAT C ACGGGAT ATGT T G AT AGAG ACATCC AGC TAC 
TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 
AC AG AC TCC AGG AC AAAC AG AGAC AT GC AT GGCCTGTT T GAT GT GG AG AT CT CT CT GACCGT CC AAGAGAA 
CGCCGGGAGC AT AT CCTGTT CCAT GCGGC ATGCT C ATCT G AGCCGAGAGGTGGAATCCAGGGT ACAGAT AG 
GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 
TTTTTTGGCATTGTTGGACTGAAGATTTTCTTCTCCAAATTCCAGTGGAAAATCCAGGCGGAACTGGACTG 
G AGAAG AAAGC ACGGAC AGGCAGAAT TGAGAG ACGC CCGGAAAC ACGC AGTGGAGGT G ACTCT GGATC C AG 
AGACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTG 
CCTC AC TC TG AGAAGAGATT TAC AAG GAAG AGT GT G GT GGCT TC T CAG AGT TTC CAAGC AGGGAAAC AT TA 
CT GGGAGGTG GACGGAGG AC AC AAT AAAAGGT GGCGCGT GGGAGT GTGCC GGGATGAT GT GG AC AGGAGG A 
AGGAGTACGTGACTTTGTCTCCCGATCATGGGTACTGGGTCCTCAGACTGAATGGAGAACATTTGTATTTC 
ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTA 
TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 
AAGGCT TATT GAGGCC CT AC AT TGAGT ATC CGT CCT AT AATG AGCAAAAT GGAACT CCCAT AGTCATC TGC 
CC AGTC ACCC AGGAAT C AGAGAAAGAGGC C TCT T GGC AAAGGGCCTCT GC AAT C CC AGAGAC AAGC AAC AG 
T GAGTCCT CC TC AC AGGCAACC ACGC CCT T CCT CCCCAGGGG TGAAAT GTAGG ATGAATC AC AT CC C ACAT 
TCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCA 
GATGAAGGGGGACTGGCCTGTCCACATGGGAGTCAGGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGG 
CT GAC ATT AC AT TT AGTT T GCTCTCACTCC ATCT GGCT AAGTGATCTTGAAATACC ACCT CTCAGGTG AAG 
AACCGT C AGG AATTCC CATCTC AC AGGCTGT GGT GT AGAT T AAGTAGAC AAGGAAT GTGAAT AATGCT TAG 
AT CT T AT T GATGAC AG AGT GT ATC CT AATGGT T T GTTCAT T AT ATT AC AC TT TC AGT AAAAAAA 
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FIGURE 8d 

MAMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEVRFFRGQFSSVVH 
LYRDGKDQPFMQMPQYQGRTKLVKDSIAEGRISLRLENITVLDAGLYGCRISSQSYYQKAIWELQ 
VSALGSVPLISITGYVDRDIQLLCQSSGWFPRPTAKWKGPQGQDLSTDSRTNRDMHGLFDVEISL 
TVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQSFQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLSPDHGYWVLRLNGEHLYFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
P I V I C P VT QE S EKE AS WQRAS AI PETSNSESSS QAT T P FLPRGEM 

Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 239-255 
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FIGURE 85 

AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGACATGCTGCTGCTGCTGCTGCCCCT 
GCTCTGGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAACTGCTGACGATGCAGAGTTCCGTGA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACCCACATACCAAGAATTGCACCCTGAGCATCAGAGATGCCAGAAGAAGTGATGCGGGGAGA 
TACTTCTTTCGTATGGAGAAAGGAAGTATAAAATGGAATTATAAACATCACCGGCTCTCTGTGAA 
TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 
GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 
GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 
ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGAGCATGACTGTCTTCCAAGGA 
GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 
GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 
GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 
CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 
GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 
CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 
TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 
AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCCTT GGGAC TCGCGGGGACAGGAGGCCAC TGACACCGAGTACT CGGAGAT CAAGATCCACAG 
ATGAGAAAC TGCAGAGAC TCAC CCT GAT T GAGGGATCACAGCCC CT CCAGGCAAGGGAGAAGTCA 
GAGGC TGAT T CT TGTAGAAT TAACAGCCC T CAACG TGATGAGC TAT GATAACAC TATGAAT TAT G 
T GCAGAGT GAAAAGCACACAGGCT T T AGAGTCAAAGTATCT CAAACC T GAAT CCACAC T GT GCCC 
T CCCT T T TAT T T TT TTAACTAAAAGACAGACAAAT T CC T A 
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FIGURE 86 



■MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPG.PWHGYWFREG 
ANTDQDAPVATNNPARAVWEETRDRFHLLGDPHTKNCTLS IRDARRSDAGRYFFRMEKGS IKWNY 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVSLQSKATSGVTQGWGGAGATALVFLSFCVIFVWRSCRKKSARPAAGVGDTGIE 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 
YSEIKIHR 

Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 351-370 
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FIGURE 87 



AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 
CCAAGGAAAGTGCAGCTGAGACTCAGACAAGATTACAATGAACCAACTCAGCTTCCTGCTGTTTC 
TCATAGCGACCACCAGAGGATGGAGTACAGATGAGGCTAATACTTACTTCAAGGAATGGACCTGT 
TCTTCGTCTCCATCTCTGCCCAG7UVGCTGCAAGGAAATCAAAGACGAATGTCCTAGTGCATTTGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGCGAT CGCT GGTCCAGT CAGCAGGGCAGCAAAGCAGAC TACC CAGAGGGGGACGGCAAC T GGGC 
CAACTACAACACCTTTGGATCTGCAGAGGCGGCCACGAGCGATGACTACAAGAACCCTGGCTACT 
ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 
AGAAA.CAGCTCCCTGCTGAGGTACCGCACGGACACTGGCTTCCTCCAGACACTGGGACATAATCT 
GT T T GGCAT CT ACCAGAAATAT CCAGT GAAATATGGAGAAGGAAAG T GTT GGAC TGACAACGGC C 
CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 
GGCCAGCGGGAAT TCAC TGCGG GAT T TGTT CAGT TCAGGG TAT T TAAT AACGAGAGAGCAGC CAA 
CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 
p. GATACTTTCCAGAGGCCAGTCCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATAT 
ggf GGAACTCATGTTGGTTACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCG 
Q T TGAGAGT T T TGTGGGAGGGAACCCAGACC T C TCC T CCC AACC ATGAGAT C CCAAGGATGGAGAA 

III CAAC TT ACCCAG TAGC TAGAAT GT TAAT GGCAGAAGAGAAAACAAT AAAT CATAT TGAC TCAAGA 

«J AAAAAA 

m 
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FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFGSAEAAT 
S DD YKNPG Y YD I QAKDLG I WHVPNKS PMQHWRNS S LLR YRTDT G FLQT LGHNL FG I YQKYPVKYG 
EGKCWTDNGPVIPWYDFGDAQKTASYYSPYGQREFTAGFVQFRVFNNERAAJSTALCAGMRVTGCN 
TEHHC I GGGGYFPEAS PQQCGD FSG FDWS GYGTHVGYS S SRE I TEAAVLL FYR 



Important features : 
N r Signal peptide: 

5; amino acids 1-16 



m 

H 

2 5'2 



N-glycosylation site. 

amino acids 163-167 

Glycosaminoglycan attachment sites. 

amino acids 74-78, 289-293 



N-myristoylation sites. 



W amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8q 

CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCC ATG GGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTGTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTC TGA TTACCTTCA 
TGACGGGAACC TAAGGAC GAAGC C T ACAGGGGCAAGGGCCGC T TCG TAT T CC TGGAAGAAGGAAG 
GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 
T TAAGACT TATATACAG TTT TAG G G G AC AAT TAAAAAAAAAAAA 
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FIGURE qo 

MGRVSGLVPSRFLTLLAHLWVITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VE LAG FLSGVSMFNS TQS L I S I GAHCSASVAL S FFI FERWECT T YWY I FVFCS ALPAVTEMALFV 
TVFGLKKKPF 

Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 
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FIGURE qi 



m 

I i: \ 



CT GGGAC CCCGAAAAGAGAAGGGGAGAGCGAGGGGAC GAGAGC GGAGGAGGAAGATGCAAC T GAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CAC CC T CAGCCAAGG TGAAGAAAAT CT T T GGC TGGGGCGAC T T C TAC T CCAACAT CAAGACGGTG 
GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
CCAACACAAT GCCACAGGCCAGGGAAACATCT CCATCAGCC TC GT GCCCC C CAG TAAAGCT GTAG 
AGT T CCACCAGGAACAGCAGAT CTT CATC GAAGCCAAGGCC TCCAAAAT C T T CAAC TGCCGGAT G 
GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTCGCTTTGCACCCACGACCCAGCCAAGATCTG 
CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
\\ T CTACAT CGCC T TC TACAGCACGGAC TAT C GGC TGG TCCAGAAGGT GT GCC CAGAT TACAAC TAC 

H CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 
V l TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 
JL AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 
j|J AGAAAGGGTCCCAAGTGC TGG T CCCAACCT GAAGCT G TGGAGTGACTAGAT CACAGGAGCAC TGG 

p AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 
111 TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 
M CCT T GGT T CT TGCCATCC T GAGGAAAGATAGCAACAGGGAGGGGGAGAT T T CAT CAGTG TGGACA 

^ GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 

GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACACCCATCCTTAAGCTAAGACAGGACGATTGTGGTCCTCCCACACTAAGGCCACAGCCCATC 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE Q2 



MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMANSTL 
LGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
SVHFQHNATGQGNI S ISLVPPSKAVEFHQEQQI FIEAKASKI FNCRMEWE.KVERGRRTSLCTHDP 
AKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 




N-glycosylation sites. 



2-oxo acid dehydrogenases acyl transferase 



amino acids 62-65, 127-130, 137-140, 143-146 



amino acids 61-71 
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FIGURE Q3 

CGGTGGCC ATGA CTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGATGGACCAACACAGAAATATCTGCTGATCTTTGGAGCGTTTGTCTCTGTCTATATCCAAGAA 
ATGTTCCGATTTGCATATTATAAACTCTTAAAAAAAGCCAGTGAAGGTTTGAAGAQTATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAGCCTGAAACTCTGCCTGCTCTGCCAAGACAAGAACTTTCTTCTTTACAACCAGCGCTCCAGAT 
AACCTCAGGGAACCAGCACT TC CCAAACCGCAGAC TACATC T T TAGAGGAAGCACAACTGTGCCT 
T T T T C T GAAAAT CCC T T T T TC TGGT GGAAT T GAGAAAGAAATAAAAC TAT GCAGATA 
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FIGURE Q4 

MTAAVFFGCAF I AFGPALAL YVFT I AI E PLR 1 1 FL I AGAFFWL VS LL I S S LVWFMARVI I DNKDG 
PTQKYLL I FGAFVS VYI QEMFRFAY YKLLKKAS EGLKS INPGETAPSMRLLAYVSGLGFGIMSGV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQTFISSYYGINLASAFIILVLMGTWAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site. 

amino acids 120-123 

Sodium: neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE Q5 

AAT T T T T CACCAGAGTAAACT T GAGAAACCAAC TGGACC T TGAGTAT T GT ACAT T T TGCC TCGT G 
GACCCAAAGGTAGCAATCTGAAACATGAGGAGTACGATTCTACTGTTTTGTCTTCTAGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACAC T ACCAAACCAACAGCAGT CAAAT CAGGTC T T TCC T T CT TT AAGT C T GATACCAT TAACA 
CAGATGCTCACACTGGGGCCAGATCTGCATCTGTTAAATCCTGCTGCAGGAATGACACCTGGTAC 
CCAGACCCACCCATTGACCCTGGGAGGGTTGAATGTACAACAGCAACTGCACCCACATGTGTTAC 
CAAT TT T T GT CACACAAC T T GGAGC C CAGGGCACTAT CC TAAGCT CAGAGGAATTGCCACAAAT C 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
TAATCCAGATGTCCAGGATGGAAGCCTTCCAGCAGGAGGAGCAGGTGTAAATCCTGCCACCCAGG 
GAACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 
GCAGGCATCCAAAGGAGCACACATGCCATCGAGGAAGCCACCACAGAATCAGCAAATGGAATTCA 
GTAAGCTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
ATT GAT TATAT TATGGAAT AGAT T GAGACACAT TGGAT AGTC T T AGAAGAAAT TAAT TCT TAAT T 
T ACC T GAAAATATTCT T GAAAT T T CAGAAAAT AT G T T CTATGTAGAGAAT C CCAACT T T TAAAAA 
CAATAATTCAATGGATAAATCTGTCTTTGAAATATAACATTATGCTGCCTGGATGATATGCATAT 
T AAAACATAT T TGGAAAAC TGGAAAAAAAAAAAAAAAAAAAA&AAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAZ^^AAAAAAAAAAAAAAAAAAAA 
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FIGURE q6 



MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
LPQIFTSLIIHSLFPGGILPTS QAGANPDVQDG S L PAGGAGVNPAT QGT PAGRL PT P SG 
TDDDFAVTTPAGIQRSTHAIEEATTESANGIQ 

Signal peptide: 

amino acids 1-16 
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FIGURE Q7 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAG.CCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACGTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTACCAGCTACAGGTCACCCTGGAGATGCAGGATGGACATGTCTTGTGGGGTCCAC 
AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
GGATGAGCCAGGCACAGCCAACTCGGATCTTCGATTCCACATCCTGAGCCAGGCTCCAGCCCAGC 
» , CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
ACCAGCCTTGACCACGCCCTGGAGAGGACCTACCAGCTGTTGGTACAGGTCAAGGACATGGGTGA 
Q CCAGGCCTCAGGCCACCAGGCCT^CTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 
CI CCCTAGAGCCTATCCACCTGGCAGAGAATCTCAAAGTCCTATACCCGCACCACATGGCCCAGGTA 
SI CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
b'j'j TGCAGAGGGAAACCTCTACGTGACCAGAGAGCTGGACAGAGAAGCCCAGGCTGAGTACCTGCTCC 
; f; AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
^ ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 
M CAGTCCACCAGGTACTGAAGTGACTAGACTGTCAGCAGAGGATGCAGATGCCCCCGGCTCCCCCA 
Iff ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 
J ' CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
% % CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
j»* TCGAAGTCGCAGTCACAGATATCAATGATCACGCCCCTGAGTTCATCACTTCCCAGATTGGGCCT 
111 ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
§3 CCTCGAGCCCGCCTTCCGCCTCATGGATTTTGCCATTGAGAGGGGAGACACAGAAGGGACTTTTG 
fj[J GCCT GGATTGGGAGCCAGACT C T GGGCATGTTAGAC TCAGACT C TGCAAGAACC TCAG T TATGAG 

GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 
111 ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 

CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
AT T T T CACCCAC TGGACCATGT CAAGGAAGAAGGACCCGGATCAAC CAGCAGAC AGCGT GCCCCT 
GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
T CCCCT GGGAGAGAGCCCAGCACCCAAGATCCAGCAGGGGACAGGACAGAG T AGAAGCCCC T CCA 
TCTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAACTT 
TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE q8 



MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPG I P FL FLEAS DRDE PGTANS DLRFH I LS QAPAQPS PDMFQLE PRLGALALS PKG 
STSLDHALERTYQLLVQVKDMGDQASGHQATATVEVSIIESTWVSLEPIHLAENLKVXYPHHMAQ 
VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQMSHGEDYAAPLELHVL 
VMDEND3STVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTS GSVTLGVLPLRAGQNI LLLVLAMDLAGAEGGFS S TCE VE VAVTD I NDHAPEFIT S Q I G 
H PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
E AAP S HE WVWQS VAKL VGP G PG P G AT AT VT VL VE RVM P P P KL DQE S YE AS VP I SAP AG S FLL T 
J IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 
M QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
SJ REHI IPVWSHNAQMWQLLVRVIVCRCNVEGQCMRKVGRMKGMPTKLSAVGILVGTLVAIGI FLI 

L I FTHWTMSRKKDPDQPADSVPLKATV 



Mi 



Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 7 62-784 
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FIGURE qq 



GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AGT CCAAGAT T CT TCCCAGGAACACAAACG TAGGAGACCCACGC TCC T GGAAGCACCAGCC T T T A 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
AT C T AGG ACCC AGGCAT CTTGCTTT CCAGCCACAAAGAGACAGATGAAG AT GC AGAAAGGAAAT G 
TTCTCCTTATGTTTGGTCTACTATTGCATTTAGAAGCTGCAACAAATTCCAATGAGACTAGCACC 
TCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGCCACCAACTCTGGGTCCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCATAGT CACCAAC T CT GAGT TCCAT ACAACC TCCAGTGGGAT CAGCACAG CCACCAAC TC T GAG 
TTCAGCACAGCGTCCAGTGGGATCAGCATAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGG 
GGC CAGCACAGCC ACCAAC TGT GAGT CCAGCACACCCTCCAGT GGGG CCAGCACAGT CACCAACT 
CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
~t*\ AGTAGGGCCAGCACTGCCACCAACTCTGAGTCTAGCACACTCTCCAGTGGGGCCAGCACAGCCAC 
}Ej CAACTCTGACTCCAGCACAACCTCCAGTGGGGCTAGCACAGCCACCAACTCTGAGTCCAGCACAA 
U C CT CCAGT GGGGCCAGCACAGCCACCAAC T CT GAGT CCAGCACAGT G T CCAGTAGGGCCAGCACT 

Si GCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAG 
|.y AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
\J GCACAGCCACCAACTCTGACTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAG 
TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 
GGCTAGCACAGCCACCAACTCTGACTCCAGCACAACCTCCAGTGGGGCCGGCACAGCCACCAACT 
CTGAGTCCAGCACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACACCCTCC 
AGT GGGGCCAACACAGCCACCAACTC TGAG T CCAGTACGACCT CCAG TGGGGCCAACACAGCCAC 
CAACTCTGAGTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAGTCCAGCACAA 
CCTCCAGTGGGGTCAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCTAGCACA 
GCCACCAACTCTGACTCCAGCACAACC TCCAG TGAGGCCAGCACAGCCACCAAC T CTGAG TC TAG 
CACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACAACCTCCAGTGGGGCCA 
ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAACAGCAGCTCTGACTGGA 
ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 
GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
GT GGAGTCC T AACTGG TT C TGGAGGAGAC CAGTATCATCGATAGCCATGGAGATGAGCGGGAGGA 
ACAGCGGGCCCTGAGCAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AATC T T GAAGAAGGTAT T CC TC ACC TTTCTTGCCTT T ACCAGAC AC T GGAAAGAGAAT AC TATAT 
TGCTCATT TAGC TAAGAAATAAATACATCTCATCTAACACACACGACAAAGAGAAGC TGTGC TTG 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 
CATC TGGCAT TCAAAATC TCCACAG TAAAA.TCCAAAGACCT CAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



§ m 
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FIGURE 100 

MKMQKGNVLLMFGLLLHLEAATNSNETSTSANTGSSVISSGASTATNSGSSVTSSGVSTATISGS 
SVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GASTVTNSGSSVTSSGASTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWEIFLITLVSWA 
AVGLFAGLFFCVRNSLSLRNTFNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWSPNWFWRRPVSSI 
AMEMSGRNSGP 

Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 510-532 
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FIGURE 101 

GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCT GGAAC TATCAGGC T GAAAAACAGAGT GGGTACTC TC TT C T GGGAAGC TGGCAACAAAT 
GGATGATGTGATATATGCATTCCAGGGGAAGGGAAATTGTGGTGCTTCTGAACCCATGGTCAATT 
AAC GAGGCAGT T TC TAGC TAC T GC ACGT ACT T CATAAAGCAGGACT C TAAAAGC TTT GGAAT CAT 
GGTGTCATGGAAAGGGATTTACTTTATACTGACTCTGTTTTGGGGAAGCTTTTTTGGAAGCATTT 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
GATTATAACTGGGGATGCATTTGTTCCTGGAGAAAGAAGTGTCATTATCATGAACCATCGGACAA 
GAAT GGACTGGATGT TCC TGTGGAAT TGCCTGATGCGATATAGCTACC TCAGAT TGGAGAAAAT T 
TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 
C T T CAT T CAT AG G AAAT GG AAGG AT G AC AAG AG C CAT T T C G AAG AC AT GAT T GAT TAC T T T T G T G 
AT AT T CACGAACCACT TCAACT CC T CAT AT TCCCAGAAGGGAC T GAT CTCACAGAAAACAGCAAG 
TCTCGAAGTAATGCATTTGCTGA7VAAAAATGGACTTCAGAAATATGAATATGTTTTACATCCAAG 
AACTACAGGCTTTACTTTTGTGGTAGACCGTCTAAGAGAAGGTAAGAACCTTGATGCTGTCCATG 
ATATCACTGTGGCGTATCCTCACAACATTCCTCAATCAGAGAAGCACCTCCTCCAAGGAGACTTT 
CCCAGGGAAATCCAC TTT CACGTCCACCG GTAT CCAATAGACACCCT CCCCACATCCAAGGAGGA 
CC T T CAAC T C T GGT GCCACAAACGG T GGGAAGAGAAAGAAGAGAGGC TGCGT TCC T TCT AT CAAG 
GGGAGAAGAATTTTTATTTTACCGGACAGAGTGTCATTCCACCTTGCAAGTCTGAACTCAGGGTC 
CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATAT TT GT ACAGTCT TGTTAAGTGGTAT T TTATAATCACCATTGTAATCTT TGTGCTGCAAGAGA 
GAATATTTGGTGGACTGGAGATCATAGAACTTGCATGTTACCGACTTTTACACAAACAGCCACAT 
T TAAAT T CAAAGAAAAAT GAG TAAG AT TATAAGGT T T GCCATG T GAAAACC TAGAGCATAT T T TG 
GAAATGTTCTAAACCTTTCTAAGCTCAGATGCATTTTTGCATGACTATGTCGAATATTTCTTACT 
GCCATCATTATTTGTTAAAGATATTTTGCACTTAATTTTGTGGGAAAAATATTGCTACAATTTTT 
TTTAATCTCTGAATGTAATTTCGATACTGTGTACATAGCAGGGAGTGATCGGGGTGAAATAACTT 
G G G C C AG AAT AT TAT T AAACAAT CAT C AG G C T T T T AAA 
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FIGURE 102 

MHSRGREIYVLLNPWSINEAVSSYCTYFIKQDSKSFGIMVSWKGIYFILTLFWGSFFGSIFMLSP 
FLPLMFVNPSWYRWINNRLVATWLTLPVALLETMFGVKVI ITGDAFVPGERSVI IMNHRTRMDWM 
FLWNCLMRYS YLRLEKI CLKASLKGVPGFGWAMQAAAYI FI HRKWKDDKSH FEDMIDYFCDI HEP 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 
YPHNI PQSEKHLLQGDFPRE IHFHVHRYP IDTLPTSKEDLQLWCHKRWEEKEERLRS FYQGEKNF 
YFTGQSVI PPCKSELRVLWKLLS I LYWTLFS PAMCLL I YLYSLVKWYFI I T I VI FVLQERI FGG 
LEI IELACYRLLHKQPHLNSKKNE 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 
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FIGURE 103 



CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
T CCAAAT CAT CCATCCACCCCT GCTGTCATCTGT TT T CATAGT GTGAGAT CAAC C CACAGGAATA 
TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CACCTCTACAGAGATGGGGAAGACTGGGAATCTAAGCAGATGCCACAGTATCGAGGGAGAACTGA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
I , AGGATT T GTCT TCAGAC TC CAGAGCAAAT GCAGATGGGTACAGC CTGTAT GAT G T GGAGAT C TCC 

AT TAT AG T CCAGGAAAAT GCT GGGAGCATAT TG T GT TCGATCCACCT T GC T GAGCAGAGTCATGA 
GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
CI TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
If! AAAT CCAAAGGGAAAATCCAGGCGGAAC T GGAC TGGAGAAGAAAGCACGGACAGGCAGAAT TGAG 

y| AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 
Z') TTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTGCCTCACTCTGAGAAGAGA 

TTTACAAGGAAGAGTGTGGTGGCTTCTCAGGGTTTCCAAGGAGGGAGACATTACTGGGAGGTGGA 
H CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 
ll\ ATGTGACTTTGTCTCCCAACAATGGGTATTGGGTCCTCAGACTGACAACAGAACATTTGTATTTC 
* ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
m GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 
y TGACATGT CAGT TT GAAGGC T TGT T GAGACCC TATATCCAGCAT GCGATGTATGACGAGGAAAAG 

Wl GGGACTCCCATATTCATATGTCCAGTGTCCTGGGGATGAGACAGAGAAGACCCTGCTTAAAGGGC 
Q CCCACACCACAGACCCAGACACAGCCAAGGGAGAGTGCTCCCGACAGGTGGCCCCAGCTTCCTCT 
fft CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
^ TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 
|jf : TCAGAAGCCATGGCTGCCCTGAAGTGGGGACGGAATAGACTCACATTAGGTTTAGTTTGTGAAAA 
111 CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 
ACAGTGAT TCCTGCCTCACAGGTGAAGAT TAAAGAGACAACGAATGTGAATCATGCT TGCAGGTT 
TGAGGGCACAGTGTTTGCTAATGATGTGTTTTTATATTATACATTTTCCCACCATAAACTCTGTT 
TGCT TATTCCACAT TAAT TTACTTTTCTCTATACCAAATCACCCATGGAATAGTTAT TGAACACC 
TGCTTTGTGAGGCTCAAAGAATAAAGAGGAGGTAGGATTTTTCACTGATTCTATAAGCCCAGCAT 
TACCTGATACCAAAAGCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAACTACAGGTCCATATCC 
C T CAT T AAC ACAGACACAAAAAT T C T AAAT AAAAT T T TAAC AAAT T AAAC T AAAC AATATAT T TA 
AAGAT GATAT ATAAC TAC T CAGTGTGGT T T GT CCCACAAATGCAGAGT TGG T T TAATAT TTAAAT 
AT CAAC C AG T G T AAT T C AG C AC AT T AAT AAAG T AAAAAAGAAAAC C AT AAAAAAAAAAAAAAA 
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mafvlilvlsfyelvsgqwqvtgpgkfvqalvgedavfscslfpetsaei^ievrffrnqfhawh 
lyrdgedweskqmpqyrgrtefvkdsiaggrvslrlknitpsdiglygcwfssqiydeeatwelr 
vaalgslplisivgyvdggiqllclssgwfpqptakwkgpqgqdlssdsranadgyslydveisi 
ivqenagsilcsihlaeqsheveskvligetffqpspwrlasillgllcgalcgvvmgmiivffk 
skgkiqaeldwrrkhgqaelrdarkhavevtldpetahpklcvsdlktvthrkapqevphsekrf 
trkswasqgfqagrhywevdvgqnvgwyvgvcrddvdrgknnvtlspnngywvlrlttehlyft 
fnphfislppstpptrvgvfldyeggtisffntndqsliytlltcqfegllrpyiq'hamydeekg 

M s TPIFICPVSWG 

war? 

Jl Signal peptide: 

yi 

s s a amino acids 1-17 

Transmembrane domains: 

tf- amino acids 131-150, 235-259 
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CCTTCACAGGACTCTTCATTGCTGGTTGGCAATGATGTATCGGCCAGATGTGGTGAGGGCTAGGAAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CT CACT GT T CAT TAT GT G AG AT AT AATCAAAAGAAGACC T AC AAT T ACT AT AGC AC AT TGTC AT TT AC AAC 
TG ACAAACT AT ATGCT GAGT T TGGCAGAGAGGCTT CT AAC AAT T TT AC AG AAAT GAGC CAGAG ACT T G AAT 
CAATGGTGAAAAATGCATTTTATAAATCTCCATTAAGGGAAGAATTTGTCAAGTCTCAGGTTATCAAGTTC 
AGTC AACAGAAGCATGGAGT GTT GGC TCAT AT GCT GTT GAT T T GT AGAT T TC ACTCT ACT GAGG ATCC TG A 
AACTGTAGATAAAATTGTTCAACTTGTTTTACATGAAAAGCTGCAAGATGCTGTAGGACCCCCTAAAGTAG 
AT CCT C AC TC AGTT AAAATT AAAAAAATC AACAAGACAGAAAC AGAC AGCT AT C TAAACC AT T GCTGC GGA 
ACACGAAGAAGT AAAACT CT AGGT C AGAGT CT CAGG AT CGTT GGTGGGAC AGAAGTAGAAGAGGGT GAAT G 
GCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGTGGAGCAACCTTAATTAATGCCACATGGC 
TT GTGAGT GCTGCT C ACT GT TT T ACAAC AT AT AAGAAC C CTGCC AGAT GGACT GCT T CCT TT GG AGT AAC A 
ATAAAACCTTCGAAAATGAAACGGGGTCTCCGGAGAATAATTGTCCATGAAAAATACAAACACCCATCACA 
^1 TGACTATGATATTTCTCTTGCAGAGCTTTCTAGCCCTGTTCCCTACACAAATGCAGTACATAGAGTTTGTC 
TCCCTG AT GC AT CCT ATGAGTTT CAACC AGGT G AT GTG ATGT TT GT GAC AGGATTTGG AG CACT GAAAAAT 
GATGGT TAC AGT C AAAAT C ATCT T CG ACAAGC AC AGGT G AC T CT C AT AG ACGCT AC AACT TGCAAT GAAC C 
T C AAGC TT AC AATGACGCCATAACTC CT AGAATGT T AT GT GC TG GCTCCT T AGAAGGAAAAACAGAT G CAT 
GCC AGGGT GACT CT GG AGGACC ACTGGT T AGT TC AGAT GCT AGAGAT ATC T GGT ACCTTGCT GGAAT AGT G 
pi| AGCTGGGGAGATGAATGTGCGAAACCCAACAAGCCTGGTGTTTATACTAGAGTTACGGCCTTGCGGGACTG 
fjfj GATTACTTCAAAAACTGGTATC TAA GAGACAAAAGCCTCATGGAACAGATAACATTTTTTTTTGTTTTTTG 
GGTGTGGAGG CC AT T T T T AG AGAT AC AG AATT GGAG AAGACT TGC AAAAC AGCT AG ATTT GACTGATC TCA 
ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
G ATCAACT CT GT C ATCT GT GAG CAAT AGT T GAAACT TT AT GT AC AT AG AGAAAT AG AT AATACAAT AT T AC 
ATT ACAGCCT GT AT TCATTTGTTCTC T AGAAG TT TT GT CAGAAT T TTG ACTT GT TG AC AT AAATT T GT AAT 
GCATATATACAATTTGAAGCACTCCTTTTCTTCAGTTCCTCAGCTCCTCTCATTTCAGCAAATATCCATTT 
T C AAGG TGC AGAACAAGG AGT GAAAG AAAAT AT AAGAAGAAAAAAAT C CC CT AC AT TTT ATT GGC ACAGAA 
AAGTATTAGGTGTTTTTCTTAGTGGAATATTAGAAATGATCATATTCATTATGAAAGGTCAAGCAAAGACA 
GC AG AATACCAATC ACTT CATC AT T T AGGAAGT ATGGG AACT AAGT T AAGGAAGTCC AGAAAGAAGCC AAG 
ATATATCCTTATTTTCATTTCCAAACAACTACTATGATAAATGTGAAGAAGATTCTGTTTTTTTGTGACCT 
AT AAT AATT AT ACAAACT T CAT GC AATGT ACT TGT T CT AAGC AAATTAAAGC AAAT ATT TAT T TAACATTG 
TTACTGAGGATGTCAACATATAACAATAAAATATAAATCACCCA 
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O 
m 



MMYRPDVVRARKRVCWEPWVIGLVIFISLIVLAVCIGLTVHYVRYNQKKTYNYYSTLSFTTDKLY 
AE FGREASNNFTEMS QRLE SMVKNAFYKS PLRE E FVKSQVI KFS QQKHGVLAHMLL ICRFHSTED 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTAS FGVTIKPSKMKRGL 
RRIIVHEKYKHPSHDYDISLAELSSPVPYTNAVHRVCLPDASYE FQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVTLIDATTCNEPQAYNDAITPRMLCAGSLEGKTDACQGDSGGPLVSSDARDIWYLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 

amino acids 21-40 (type II) 
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AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
AT AAT G T TC T C AC T T TAAC TGCAGAAAC GAGGG T AGAGGAAGC AG T CAT T T T GACT T ACT TT CC T 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GTGTAGAACTGGCTTGTGGCGTTTGGACATATGAACAGGAACTTATGGTTCCAGTACAATGGTCA 
GAT AT GGT CACT TT GAAAGCCAGGAT GAC AAAT TATGGAT TACC TAGATATCGG T GGC T TAC TCA 
TGCTTGGAATTTTTTTCAGAGAGAGTTTAAGTGCTGTGGAGTAGTATATTTCACTGACTGGTTGG 
H - AAATGACAGAGATGGACTGGCCCCCAGATTCCTGCTGTGTTAGAGAATTCCCAGGATGTTCCAAA 

Q CAGGCCCACCAGGAAGATCTCAGTGACCT TTATCAAGAGGGTT GT GGGAAGAAAATGTAT TCCT T 

fl TTTGAGAGGAACCAAACAACTGCAGGTGCTGAGGTTTCTGGGAATCTCCATTGGGGTGACACAAA 
pr= TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
fl ACAGACCAAATGATG TCC T TGAAGAAT GACAAC TCTCAGCACC T G TCAT G T CCC TCAGTAGAACT 

* GTTGAAACCAAGCCTGTCAAGAATCTTTGAACACACATCCATGGCAAACAGCTTTAATACACACT 
SI T TGAGATGGAGGAGT TATAAAAAGAAATGTCACAGAAGAAAACCACAAACT TGTT TTAT TGGACT 

l~i TGTGAATTTTTGAGTACATACTATGTGTTTCAGAAATATGTAGAAATAAAAATGTTGCCATAAAA 
|p TAACACCTAAGCATATACTATTCTATGCT T TAAAATGAGGATGGAAAAGTT TCATGTCATAAGTC 

■ ACCAC CT GGACAATAAT T GATGCCCT T AAAAT GC TGAAGACAGAT GT CATACCCACT G TGTAGCC 

S TGTGTATGACTTTTACTGAACACAGTTATGTTTTGAGGCAGCATGGTTTGATTAGCATTTCCGCA 
0 TCCAT GCAAAC GAG TCACATAT GGT G GGAC TGGAGCCATAGTAAAGGT TGAT TTAC T TC TACCAA 

p| CT AGT AT ATAAAGTAC TAAT TAAAT GCTAACATAGGAAGT TAGAAAAT AC TAATAACT T T TAT TA 

S% C T CAG CGAT C TAT T C T T C T GAT G C TAAAT AAAT TAT AT AT CAGAAAAC T T T C AATAT T G G T GAC T 

jf ACCTAAATG TGAT TTTTGCTGGT TAC TAAAATAT TC T TACCAC T TAAAAGAGCAAGC TAACACAT 

*§ TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 
EJ T T CAGTT C T GATAATGTTAAGAATAACCAT TATGAAAAGGAAAAT T T GTCC T GTATAGCATCAT T 

||| ATTTTTAGCCTTTCCTGTTAATAAAGCTTTACTATTCTGTCCTGGGCTTATATTACACATATAAC 

T GT TAT T TAAAT AC T T AACCAC TAAT T T TGAAAAT TACC AG T G T GAT ACAT AGGAATC AT TAT T C 
AGAAT GT AGTC T GGT C T T T AGGAAGT AT TAATAAGAAAAT T TGCACAT AAC T TAG TT GAT TCAGA 
AAGGACT T GT ATGC TGTTTTTCT CCCAAAT GAAGAC T CT T T T TGACAC TAAACAC TT T T TAAAAA 
GCTTATCTTTGCCTTCTCCAAACAAGAAGCAATAGTCTCCAAGTCAATATAAATTCTACAGAAAA 
TAGTGTTCTTTTTCTCCAGAAAAATGCTTGTGAGAATCATTAAAACATGTGACAATTTAGAGATT 
CTT TGT TT TATT TCACTGAT TAATAT AC TGTGGCAAATTACACAGATT AT TAAAT T T TT TTACAA 
GAGTATAGTATATTTATTTGAAATGGGAAAAGTGCATTTTACTGTATTTTGTGTATTTTGTTTAT 
T TCTCAGAATATGGAAAGAAAATT AAAATGTGTCAATAAATATT TTCTAGAGAGTAA 
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MARE D S VKCLRC LL YALNLL FWLMS I S Vl^VS AWMRD YL^INVLT LT AS TRVEE AV I L T Y FP WH P 
WIAVCCFLIIVGMLGYCGTWRNLLLIAWYFGSLLVIFCVEIACGTOTYEQEIJWPVQWSDMVT 
LKARMTNYGLPRYRWLTHAWNFFQREFKCCGVVYFTDWLEMTEMDWPPDSCCVREFPGCSKQAHQ 
E DLS DLYQEGCGKKMYS FLRGTKQLQVLRFLGI S I GVTQILAMI LT I TLLWALY YDRREPGTDQM 
MSLKNDNSQHLSCP.SVELLKPSLSRIFEHTSMANSFNTHFEMEEL 

Signal peptide: 

amino acids 1-33 

Transmembrane domains: 

amino acids 12-35, 57-86, 94-114, 226-248 
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CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GATAAAGTCTTTCTCGTGAGAGCCTAGAGGCCTTAAAAAAAAAAGTGCTTGAAAGAGAAGGGGACAAAGGAACA 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGGATGCCTCCATTCCTGCTTCTCACCTG 
CCTCTTCATCACAGGCACCTCCGT GTC ACCCGT GGCCCT AGATCCTTGTTCT GCTTACATCAGCCTGAATGAGC 
CCTGGAGGAACACTGACCACCAGTTGGATGAGTCTCAAGGTCCTCCTCTATGTGACAACCATGTGAATGGGGAG 
TGGTACCACTTCACGGGCATGGCGGGAGATGCCATGCCTACCTTCTGCATACCAGAAAACCACTGTGGAACCCA 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
TCAAT GGGAAC T GCT GTCTCT GGAACACCAC GGTGGAAGTC AAGGCT T GCCCTGGAGGCT ACT ATGTGTATCGT 
L.L CTGACCAAGCCCAGCGTCTGCTTCCACGTCTACTGTGGTCATTTTTATGACATCTGCGACGAGGACTGCCATGG 
Q C AGCT GCT CAGAT ACC AGC GAGT GCACATGCGCTCCAGGAACT GTGC T AGGCCCTGACAGGCAGACAT GCTTT G 

Q AT GAAAAT GAAT GT GAGC AAAAC AAC G GTGG CT GC AGT GAGAT CT GT GT GAACCT C AAAAACT C C T AC C GCT GT 

fft GAG T GT GGGGT T GGCC GT GT GC T AAGAAGT GAT G GC AAGAC T T GT GAAGAC GT T G AAGGAT GCC AC AAT AAC AA 

iUJ TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 
SI CT G AGGAT AAC C AC ACT T GCCAAGT CC C TGT GT T GT GC AAAT C AAAT GCC AT TGAAGT GAAC AT C C C C AGG GAG 

P : CT GGT T G GT GG CC T GGAGC T CT T CC T G ACC AAC AC CTCCTGCC GAGGAGT GT CC AAC GGC AC CC AT GT C AAC AT 

W CCTCTTCTCTCTCAAGACATGTGGTACAGTGGTCGATGTGGTGAATGACAAGATTGTGGCCAGCAACCTCGTGA 
^ CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
y ACCTGCGAGTTTCCACGCCTGTACACCATTTCTGAAGGATACGTTCCCAACCTTCGAAACTCCCCACTGGAAAT 

CATGAGCCGAAATCATGGGATCTTCCCATTCACTCTGGAGATCTTCAAGGACAATGAGTTTGAAGAGCCTTACC 
GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
TTGGAAAGCTTGGTGGAGAGCTGCTTTGCCACCCCCACCTCCAAGATCGACGAGGTCCTGAAATACTACCTCAT 
CCGGGATGGCTGTGTTTCAGATGACTCGGTAAAGCAGTACACATCCCGGGATCACCTAGCAAAGCACTTCCAGG 
TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAAT GCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
CGGTCTACAGGGCCAGACGCTAACAGGCGGCCCGATCCGCATCGACTGGGAGGACTAGTTCGTAGCCATACCTC 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 
CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGTGGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
CACAAAATCAGAAGCTGGGTATAATATTTCAAGTTACAAACCCTAGAAAAATTAAACAGTTACTGAAATTATGA 
CTTAAATACCCAATGACTCCTTAAATATGTAAATTATAGTTATACCTTGAAATTTCAATTCAAATGCAGACTAA 
T TAT AGG GAAT T T GGAAGT GT AT C AAT AAAACAGT AT AT AAT T T T 
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MPPFLLLTCLFITGTSVSPVALDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDAMP 
•TFCIPENHCGTHAPVWLNGSHPLEGDGIVQRQACASFNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LR S DGKTC E D VE GC HNNNGGC S H S CLGS E KGY QC EC PRGL VL S E DNH T CQ VP VLCKSN AI EVN I P RE L VGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTWDVVNDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CEFPRLYTISEGYVPNLRNSPLEIMSRNHGIFPFTLEIFKDNEFEEPYREALPTLKLRDSLYFGIEPWHV 
SGLESLVESCFATPTSKIDEVLKYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 

Important features of the protein: 
Signal peptide: 

amino acids 1-16 

N-glycosylation sites. 

amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

Tyrosine kinase phosphorylation sites. 

amino acids 411-418, 443-451 

N-myristoylation sites . 

amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 
522-528, 531-537 

Aspartic acid and asparagine hydroxy la tion site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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G AGAG AGGC AGC AGCT TGCTC AGCGGAC AAGG AT GCTGGGC G TG AGGG AC C AAGGCCT GC CC T G CACTCGG 
GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 
CCTGCTGCCTTGGGGTGACAATCTCAGCTCCAGGCTACAGGGAGACCGGGAGGATCACAGAGCCAGCATGT 
TACAGGATCCTGACAGTGATCAACCTCTGAACAGCCTCGATGTCAAACCCCTGCGCAAACCCCGTATCCCC 
AT GG AG AC CT TC AG AAAG GT GGGG AT CCCC AT CAT C AT AGC ACT ACTGAGCCT G GCGAGT AT C AT C AT TGT 
GGTTGTCCTCATCAAGGTGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGA 
GG AAGC AGCT GT GT GACGGAGAGCTGGACTGT CCCT TG GGGG AGGACG AG GAGC AC T GT GTC AAGAGC TTC 
CCCG AAGGGC CT GCAGTGGC AG TCCGCCT C TCCAAGGACC GATC C ACACT GCAGGT GCTGGACT CGGC CAC 
^* AGGGAACTGGTTCTCTGCCTGTTTCGACAACTTCACAGAAGCTCTCGCTGAGACAGCCTGTAGGCAGATGG 
GCT AC AGC AG AGCT GT GGAG AT TG GC CC AGACC AGGAT CTGG AT GTTGTT GAAATC ACAGAAAACAGC CAG 
J! GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
Z. 3 TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 
%j TC AGCAT CC AGT ACG AC AAACAGC AC GT CT GT GGAG GGAGC AT C CT GGACCCCC AC T GGGTCCTC ACGGC A 

jU GCCC ACT GCT TC AGGAAAC ATACCGATGT GT T CAAC TGGAAGGT GC GGGC AGGC TC AG AC AAACTGGGCAG 

Iff . CTTCCCATCCCTGGCTGTGGCCAAGATCATCATCATTGAATTCAACCCCATGTACCCCAAAGACAATGACA 
S TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 
is! GATG AGGAGC TC ACTCC AGCC ACCCC AC T C TGGATC ATTG GATGGGGC T T TACGAAGCAG AATGGAGGGAA 

W- 1 GAT G TC TGAC AT ACTGCTGC AGGCGTC AGT CCAGGT CAT T GACAGC AC AC GGTGCAAT GC AG ACGATGCGT 

Cv 

J;, ACCAGGGGGAAGTCACCGAGAAGATGATGTGTGCAGGCATCCCGGAAGGGGGTGTGGACACCTGCCAGGGT 

2J GACAGTGGTGGGCCCCTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 

£"2 CTGCGG GGGC CC GAGC ACCCC AGG AGT AT ACACCAAGGT C T CAG CCTATCT C AACT GG AT CT AC AATGTCT 

m 

GGAAGGCTGAGCTG TAA TGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGGATCCCCCAAAGTCAGACACAGAGCAAGAGTCCCCTTGGGTACACCCCTCT GCCC AC AGCCTC AGCAT 
TTCTTGGAGCAGCAAAGGGCCTCAATTCCTGTAAGAGACCCTCGCAGCCCAGAGGCGCCCAGAGGAAGTCA 
GCAGCCCTAGCTCGGCCACACTTGGTGCTCCCAGCATCCCAGGGAGAGACACAGCCCACTGAACAAGGTCT 
C AGGGGT AT T GC T AAGCC AAGAAGGAACTT TCCC AC ACT ACT GAAT GGAAGC AGGC T GTC T T GT AAAAGCC 
CAGATCACTGTGGGCTGGAGAGGAGAAGGAAAGGGTCTGCGCCAGCCCTGTCCGTCTTCACCCATCCCCAA 
GCCT AC T AGAGCAAG AAAC C AGTT GT AATAT AAAAT GCAC TGCC CT ACT GTT GGT AT G ACT ACCGT T ACCT 
ACTGTTGTCATTGTTATTACAGCTATGGCCACTATTATTAAAGAGCTGTGTAACATCTCTGGCAAAAAAAA 
AAAA 
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MLQDPDS DQPLNSLDVKPLRKPRI PMET FRKVG I P 1 1 1 ALLS LAS 1 1 I WVL I KVI LDKYY FLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKSFPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEALAETACRQMGYSRAVEIGPDQDLDWEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSL 
KTPRWGGEEASVDSWPWQVSIQYDKQHVCGGSILDPHWVLTAAHCFRKHTDVFNWKVRAGSDKL 
GSFPSLAVAKIIIIEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWIIGWG 
FTKQNGGKMSDILLQASVQVIDSTRCNADDAYQGEVTEKMMCAGIPEGGVDTCQGDSGGPLMYQS 
DQWHWGIVSWGYGCGGPSTPGVYTKVSAYLNWIYNVWKAEL 

Transmembrane domain: 

amino acids 32-53 (typell) 



APPJD=10063715 



Page 266 



FIGURE 113 



m 



GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCACAGGAGT 
TGAACTGCTAGGATTCTGACTATGCTGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 
ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
CAGAAGGGGTCTGCTTAGACCACCTGGTTTATGTGACAGGACTTGCATTCTCCTGGAACATGAGG 
GAACGCCGGAGGAAAGCAAAGTGGCAGGGAAGGAACTTGTGCCAAATTATGGGTCAGAAAAGATG 
GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 
CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 
Q TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 
III C TCC T TCCCT C TGAGAGGCCCT CCT AT GT CCCTAC TAAAGCCACCAGCAAGACATAGC TGACAGG 

1*1 GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 

GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 
CTCAATTTAAATCATGTTCTAGTAATTGGAGCTGTCCCCAAGACCAAAGGAGCTAGAGCTTGGTT 
CAAAT GAT CTCCAAGGGC CC TT AT ACCCCAGGAGACT TT GATT T GAAT T T GAAAC CCCAAATCCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAT^CCCCTGTCTCTACTAAAAATACAAATVAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
CCAGCTACTCGGGAGGCTGAGACAGGAGAATTACTTGAACCTGGGAGGTGAAGGAGGCTGAGACA 
GGAGAATCACT TCAGCCTGAGCAACACAGCGAGACTC TGTC TCAGAAAAAAT AAAAAAAGAAT TA 
TGGTTATTTGTAA 



vn 
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FIGURE lid 



MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 

Signal peptide: 

amino acids 1-15 



ft:::;? 



IS* if 

ru 
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FIGURE lis 



CAGCAGTGGTCTCTCAGTCCTCTCAAAGCAAGGAAAGAGTACTGTGTGCTGAGAGACCATGGCAA 
AGAATCCT CCAGAGAATTGTGAAGACTGT CACATTC TAAATGCAGAAGCTT TTAAATCCAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
G TT T T GGGGGAGGAAGCAC T TC TGGC CGGAGGTACCCAAAAAAGCC T ATGACAT GGAGCACACT T 
T C T AC AG C AAT GG AGAG AAG AAG AAG AT T T AC AT G G AAAT T GAT C C T G T G AC C AG AA.C T G AAA.T A 
T T CAGAAGCGGAAA.TGGC AC T GAT GA7\AC AT T GGAAGT GC ACG AC T T T AAAAACGGAT AC AC TGG 
CATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
M CTGAACCAGAAGAGGAAATAGATGAGAAT GAAGAAAT TACCACAACT TTCTTTGAACAGTCAGTG 

pf AT TT GGGT CCC AGCAGAAAAGC C TAT T GAAAACCGAGAT T T TC T T AAAAAT TCCAAAAT TC TGGA 

S GATTTGTGATAACGTGACCATGTATTGGATCAATCCCACTCTAATATCAGTTTCTGAGTTACAAG 
Ijl AC T T T GAG GAGGAG G GAGAAGAT C T T CAC T T T C C T GC C AAC GAAAAAAAA.G G GAT T GAAC AAAAT 

M GAACAGTGGGTGGTCCCTCAAGTGAAAGTAGAGAAGACCCGTCACGCCAGACAAGCAAGTGAGGA 
AGAACTT CCAATAAAT GAC TAT ACT GAAAA.TGGAAT AGAAT T TGAT CCCAT GC T GGAT GAGAGAG 
GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
f»j GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 
||1 TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTCTAATAGGAGGTTTGAGCTCAAATGCTTAAAC 
CI TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 

GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 
TTATCACCAAAAAAAAAAAAAAAAAAA 
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FIGURE 116 



MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGILALTLIVLFWGSKHFWPEVPKKAYDME 
HTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
EFSEPEEEIDENEEITTTFFEQSVIWVPAEKPIENRDFLKNSKILEICDNVTMYWINPTLISVSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCCIYCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVICRVIMPCNWWVARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-40 

Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylation sites. 

amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C-terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 



GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 
GGGCGGGCGGCCAGGATC ATG TCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCCGGCTGCATCGCGGCCAGCGGGATGGACATGTGGAGCACCCAGGACCTGTACGACAACCCCGTCACGT 
CCGTGTTCCAGTACGAAGGGCTCTGGAGGAGCTGCGTGAGGCAGAGTTCAGGCTTCACCGAATGCAGGCCC 
TATTTCACCATCCTGGGACTTCCAGCCATGCTGCAGGCAGTGCGAGCCCTGATGATCGTAGGCATCGTCCT 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
CCAAAGCCAACATGACACTGACCTCCGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGTG 
TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 

a ; 

J* GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
TCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGGCCTGGCACCAGAAGAAACCAACTACAAA 
f§\ GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 
h=| T GGG TC C AAC ACCAAAAACAAG AAGATAT ACG AT GG AG GT GCCC GC AC AG AGGACG AG GT AC AATCTT ATC 

%l CTT CCAAGC ACG ACT ATGT G TAAT GCTCT AAGAC CT CT C AGC AC GGGC GG AAGAAACTC CCGGAGAGC T C A 

jM< CCC AAAAAAC AAGG AGAT CCCATCT AGATT TCT T CT TGCT TTTGACT C AC AG CT GGAAGTTAG AAAAGCCT 

U ! CGATTTCATCTTTGGAGAGGCC AAATGGTCTTAGCCTC AGTCTCTGTCTCTAAATATTCC ACCATAAAACA 

GCTGAGTTATTTATGAATTAGAGGCTATAGCTCACATTTTCAATCCTCTATTTCTTTTTTTAAATATAACT 

W TTCTACTCTGATGAGAGAATGTGGTTTTAATCTCTCTCTCACATTTTGATGATTTAGACAGACTCCCCCTC 

111 

TTCCTCCTAGTCAATAAACCCATTGATGATCTATTTCCCAGCTTATCCCCAAGAAAACTTTTGAAAGGAAA 
21 GAGTAGACCCAAAGATGTTATTTTCTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAA 
m ACACTTACTGAAGAAGAAGCAATAAGAGAAAGATATTTGTAATCTCTCCAGCCCATGATCTCGGTTTTCTT 
ps ACACTGTGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTTTGAGGCAACCAAACCTTTCTACTGCTG 

TTGACATCTTCTTATTACAGCAACACCATTCTAGGAGTTTCCTGAGCTCTCCACTGGAGTCCTCTTTCTGT 
CGCGGGTCAGAAATTGTCCCTAGATGAATGAGAAAATTATTTTTTTTAATTTAAGTCCTAAATATAGTTAA 
AATAAATAATGTTTTAGTAAAATGATACACTATCTCTGTGAAATAGCCTCACCCCTACATGTGGATAGAAG 
GAAATGAAAAAATAATTGCTTTGACATTGTCTATATGGTACTTTGTAAAGTCATGCTTAAGTACAAATTCC 
AT GAAAAGCT C ACAC C TGT AAT CCT AGC ACTTTGGG AGGC TG AGGAGG AAGG AT CACT TG AGCC CAGAAGT 
TCGAGAC T AGCC T GGG CAAC AT GG AG AAGCCCTGTC TCT AC AAAAT AC AG AG AGAAAAAAT C AGCC AGTC A 
TGGTGGCATACACCTGTAGTCCCAGCATTCCGGGAGGCTGAGGTGGGAGGATCACTTGAGCCCAGGGAGGT 
TG GGGC TGC AGT GAGC CATG ATCAC ACC AC TGC ACT CCAGCC AGGT GACAT AGC GAG ATC CTGT CT AAAAA 
AATAAAAAAT AAAT AATG GAACAC AGCAAGTCCT AGGAAGT AGGTT AAAACT AAT T CT TT AA 
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FIGURE 118 

MSTTTCQWAFLLSILGLAGCIAATGMDMWSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTECRP 
YFTILGLPi^LQAVRALMIVGIVLGAIGLLVSIFALKCIRIGSMEDSAKANMTLTSGIMFIVSGL 
CAI AGVS VFANML VTNFWMS TANMYTGMGGMVQT VQTRYT FGAAL FVGW VAGGLTL I GGVMMC I A 
CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 
V 

Signal peptide: 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 
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FIGURE HQ 

GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 
AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGGCTCTTTCTCCGGACCTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGT GCAC GGGGGACAATGAGAAG GT GAAGGCT CACATTCTGCTGACGGCTGGAATCATCTT CAT 
CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 
ACT CAATAGTGAATGT T GCCCAAAAACGTGAGC TTGGAGAAGC TCTCT ACTTAGGATGGACCACGGC A 
CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
CAGATACT C GATACC TT CCC AT CGCACAACCCAAAAAAGT TAT CACACCGGAAAGAAGT C ACCGAGC G 
TCTACTCCAGAAGTCAGTATGTGTAGTTGTGTATGTTTTTTTAACTTTACTATAAAGCCATGCAAATG 
ACAAAAAT CTAT ATTAC TTT CTCAAAATGGACCCCAAAGAAAC TT TGATTTAC TGTTCTTAACTGCCT 
AAT CTTAATTACAGGAACTGT GCATCAGCTAT TTAT GAT TCTATAAGCTAT T T CAGCAGAATGAGATA 
TTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAATTTGTTTTCTAAGGTGGTTCAAGCATCTA 
CTCTTTTTATCATTTACTTCAAAATGACATTGCTAAAGACTGCATTATTTTACTACTGTAATTTCTCC 
ACGACATAGCAT TATGTACATAGATGAGT GTAACAT T TATATCTCACATAGAGACAT GCTTAT AT GGT 
TTTATTTAA^TGAAATGCCAGTCCATTACACTGAATAAATAGAACTCAACTATTGCTTTTCAGGGAA 
ATCATGGATAGGGTTGAAGAAGGTTACTATTAATTGTTTAAAAACAGCTTAGGGATTAATGTCCTCCA 
TTTAT AATGAAGATT AAAAT GAAGGCT TTAATCAGCATT GTAAAGGAAAT TGAATGGCTT TCT GATAT 
GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CTTGTGTATTAAATTAACATTTTTAAAACGCAGATATTTTGTCAAGGGGCTTTGCATTCAAACTGCTT 
TTCCAGGGCTATACTCAGAAGAAAGATAAAAGTGTGATCTAAGAAAAAGTGATGGTTTTAGGAAAGTG 
AAAATATTTTTGTTTTTGTATTTGAAGAAGAATGATGCATTTTGACAAGAAATCATATATGTATGGAT 
ATATTTTAATAAGTATTTGAGTACAGACTTTGAGGTTTCATCAATATAAATAAAAGAGCAGAAAAATA 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TAT AC AT T TAT AT TAAT AAAT T GTACAT T T T T C TAAT T 
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FIGURE 120 

MATHALEIAGLFLGGVGMVGTVAVTVMPQWRVSAFIENNIWFENFWEGLWMNCVRQANIRMQCK 
I YDS LLALS PDLQAARGmCAAS VMS FIAF 

MWL I PVS WVANAI I RDFYNS I VNVAQKRELGEAL YLGWT TALVL I VG GAL FCC VFCCNEKS S S Y 
RYSIPSHRTTQKSYHTGKKSPSVYSRSQYV 



0 s 



Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 82-101, 118-145, 164-188 
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FIGURE 121 

GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GATT CAAAGGAGAAAAGGGGGAATG T CT GAGGGAAAGCT T T GAGGAG T CC TGGACAC CCAAC TAC 
jU AAGCAGTGTTCATGGAGTTCATTGAATTATGGCATAGATCTTGGGAAAATTGCGGAGTGTACATT 
0 TACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGCTCACTTCGGCTAAAATGCA 
W GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 

fin 

«% AT T GAAGCT ATAAT T TAT T TGGACCAAGGAAGCCC TGAAATGAAT T CAACAAT TAATAT TCATCG 

m 

\\ CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 

\& TTGGCACTTGTTCAGATTACCCAAAAGGAGATGCTTCTACTGGATGGAATTCAGTTTCTCGCATC 

Hi ATTATTGAAGAACTACCAAAATAAATGCTTTAATTTTCATTTGCTACCTCTTTTTTTATTATGCC 

* T TGGAAT GGTTCACTTAAATGACATT TTAAATAAGT TTATGTATACAT CT GAATGAAAAGCAAAG 

?S C TAAATAT G T T T ACAGAC C AAAG T GT GAT T T C AC AC T G T T T T T AAAT C TAG CAT TAT T CAT T T T G 

m CTTCAATCAAAAGTGGTTTCAATATTTTTTTTAGTTGGTTAGAATACTTTCTTCATAGTCACATT 

fjjj CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 

G AAAAAATATAAAAGCTACCAATCTTTGTACAATTTGTAAATGTTAAGAATTTTTTTTATATCTGT 

11! 

T AAAT AAAAAT TAT T T CCAAC A 
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FIGURE 122 

MRPQGPAAS PQRLRGLLLLLLLQLPAPSSASE I PKGKQKAQLRQREWDLYNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
S VE GLCEG I GAGLVDVAI WVGTCS DYPKGDAS TGWNS VSR 1 1 1 EELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 123 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
ATTGCCTGGAAGAATACATCATGTTTTTCGATAAGAAGAAATTGTAGGATCCAGTTTTTTTTTTA 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCCCCTAAGCCAAAGCAAAAGACCTAAGGACGACCTTTGAACAATACAAAGGATGGGTTTCAATG 
TAAT TAGGC T AC TGAGCGGATCAGCT GTAGCAC TGG T TATAGCC CCCAC T G T C T TACTGACAAT G 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
ff- AT AACAGCC TT CAAAAAC T TAAGTATAAT CAAT T T AAAGGGCT CAAC CAGCT CACC TGGC TATAC 

Q. CT TGACCATAACCATATCAGCAATATT GACGAAAATGC TT TTAATGGAATACGCAGACTCAAAGA 

O GCTGATTCTTAGTTCCAATAGAATCTCCTATTTTCTTAACAATACCTTCAGACCTGTGACAAATT 
m TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 

CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
AGACTGCCGCAACCTGGAACTTTTGGACCTGGGATATAACCGGATCCGAAGTTTAGCCAGGAATG 



H T C TT T GCT GGCATGATCAGACTCAAAGAAC T TCACC T GGAGCACAAT CAAT T T TCCAAGC TCAAC 



CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
CATAGGACAGACCATGTCC TGGACC T GGAGC TCC T TACAAAGGC TTGAT T T ATCAGGCAAT GAGA 
TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
TCCAACAAGCTCACATTTATTGGTCAAGAGATTTTGGATTCTTGGATATCCCTCAATGACATCAG 
TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
TTAAAGGTCTAAGGGAGAATACAATTATCTGTGCCAGTCCCAAAGAGCTGCAAGGAGTAAATGTG 
ATCGAT GCAGT GAAGAAC TACAGCATC TG TGGCAAAAG TAC TACAGAGAGGT T TGATC TGGCCAG 
GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 
TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
fl| CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 

GGCACAGGAAAAAGAAAAGACAGTC CC TAAAGCAAATGAC T CCCAGCACCCAGGAATT T TATGTA 
GATTATAAACCCACCAACACGGAGACCAGCGAGATGCTGCTGAATGGGACGGGACCCTGCACCTA 
TAACAAATCGGGCTCCAGGGAGTGTGAGGT ATGAA CCATTGTGATAAAAAGAGCTCTTAAAAGCT 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
GGT CAT T T T CC T C TCATAC AT AAT CAACCCAT T GAAAT T TAAAT ACC ACAAT CAAT GTGAAGCT T 
GAAC T CCGGT T TAATATAATACC T AT T GTATAAGACCCT T TAC T GAT TCCAT TAATGT CGCAT TT 
GTT T TAAGAT AAAAC T TC T T TCAT AGGT AAAAAAAAAAA 
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FIGURE 124 

MGFNVIRLLSGSAVALVIAPTVLLTMLSSAERGCPKGCRCEGKMVYCESQKLQEIPSSISAGCLG 
LS LRYNS LQKLKYNQFKGLNQLT WLYLDHNHI SN I DENAFNGI RRLKE L I LS SNR I S YFLNNT FR 
PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 
LARNVFAGMIRLKELHLEHNQFSKLNLALFPRLVSLQNLYLQWNKISVIGQTMSWTWSSLQRLDL 
SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 
WLKSFKGLRENTIICASPKELQGVNVIDAVKNYSICGKSTTERFDLARALPKPTFKPKLPRPKHE 
SKPPLPPTVGATEPGPETDADAEHI S FHKI I AGS VAL FLSVLVI LLVI YVS WKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of" the protein: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360/ 496-499, 504-507 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 465-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 125 



CCGTTATCGTCTTGCGCTACTGCTGAATGTCCGTCCCGGAGGA.GGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
ATT GAAGAGGAAGGCT TTC TAAAGCT TTGGCAAGGAGTGACACCCGCCAT TTACAGACACGTAGT 
GTAT T CT GGAGG TCGAAT GGTCAC AT ATGAACATCT CCGAGAGGTTG T GT T TGGCAAAAGT GAAG 
ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
T TAGCCAAT CCAAC T GACC TAGTGAAGGT T CAGATGCAAATGGAAGGAAAAAGGAAAC TGGAAGG 
CI AAAACCAT T GC GAT T TCGTGGTGT ACAT CATGCATT TGCAAAAATCT T AGC T GAAGGAGGAATAC 

GAGGGC T T T GGGCAGGCTGGGT ACCCAATATACAAAGAGCAGCAC T GGTGAATATGGGAGAT TTA 

ffi 

|' !■ ACCACT TAT GATACAGTGAAACACTAC T T GGTAT TGAATACACCACT T GAGGACAATATCATGAC 

\l TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
H- TCAAAAGCAGAATAATGAATCAACCACGAGATAAACAAGGAAGGGGACTTTTGTATAAATCATCG 
ift ACTGACTGCTTGATTCAGGCTGTTCAAGGTGAAGGATTCATGAGTCTATATAAAGGCTTTTTACC 
!L„ AT CT TGGC T GAGAATGACC CC T T GG T CAAT GGTGT TC TGGC T T AC T TAT GAAAAAAT C AGAGAG A 

g» T GAGT GGAGTCAGTCCAT T T TAA 

:3s 
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FIGURE 126 

MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHWYSGGRMVTYEHLREWFGKSEDEHYPLWKS 
VI GGMMAGVI GQFLANPT DLVKVQMQME GKRKLEGKP LRFRGVHHAFAKI LAEGG I RGLWAGWVP 
NI QRAALVNMGDLT T YDTVKHYLVLNTPLEDN IMTHGLS S LCSGLVAS I LGTPADVI KSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 



is) 



APP ID=1 00637 15- 



Page 280 of 322 



FIGURE 127 



P 



w 



CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCC ATGG CCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCG 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
CAAATGAAGAGGACATCCTCTCCATGTTCTCAGCTATCCGTTCTCAGCACAGCGGTGTAGACATC 
TGCATCAAC7VATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 
AGGAGCGGAATGTGGACGATGGGCACATCATTAACATCAATAGCATGTCTGGCCACCGAGTGTTA 
CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 
GAGGCAAGAGC T TCGGGAGGCCCAGACCCACATCCGAGCCACGT GCAT C T C T CCAGGT G TGG TGG 
AGACACAATTCGCCTTCAAACTCCACGACAAGGACCCTGAGAAGGCAGCTGCCACCTATGAGCAA 
ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGC TGTTATCTACGT CC TCAGCACCCCCGCACA 
CATCCAGATTGGAGACATCCAGATGAGGCCCACGGAGCAGGTGACCTAGTGACTGTGGGAGCTCC 
TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
CACGGGATACCACTTCCTGTCCACACCCCGACCAGGGGCTAGAAAATTTGTTTGAGATTTTTATA 
TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 
TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 
TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
CAGAACACCAGGGCC T GGC CCAGT GGAT T T CATGGT GATCATTAAAAAAGAAAAATCGCAACCAA 
AAAAAAAAAA 
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FIGURE 128 

MARPGMERWRDRIAiVTGASGGIGAAVARALVQQGL 

PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGLARPDTLLSGSTSGWKDMFNVNVLALSICTR 
EAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
ISPGWETQFAFKLHDKDPEKAAATYEQMKCLKPEDVAEAVIYVLSTPAHIQIGDIQMRPTEQVT 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

N-myristoylation sites . 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 
199-205 

Short-chain alcohol dehyrogenase . 

amino acids 30-42, 104-114 
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FIGURE 12Q 



m 



AACTTCTAC ATG GGCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGTTGAGTGCTTGAAATGAGGAACTGAGAAAATTAAT-TTCTCATGTATTTTTCTCATTTATTTA 
T T AAT T T T TAAC TGATAGT T GTACATAT T T GGGGGTACAT GTGAT AT T T GGATACAT GTATACAA 
TATATAAT GAT CAAATCAGGGTAAC T G GGAT AT C CAT C AC AT C AAAC AT T TAT T T TTTATTCTTT 
TTAGACAGAGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGTGCCATCTCAGCTTACTGCAAC 
CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 
GCACCACAATGC CCAACTAAT T T T T GT AT T T T TAG TAGAGACGGGGT T T TGCCAT G T T GCCCAGG 
CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 
ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 
AACCTCTGATCACCTCATTCTACTCTCTACCTCCATGAGATCCACTTTTTTAGCTCCCACATGTG 
AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 
TT CCATCCATGTTGC TGCAAAT GACAGGATTTCGTTCTTAATTTCAAT TAAAATAACCACACATG 
GCAAAAA 
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FIGURE ISO 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGESPMPFKRVFCQDGNVRSFCVCAVHFSSHQPPVAVECLK 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

amino acids 8 6-92 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 68-79 
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FIGURE lai 



iij 



T TCT GAAGT AACGG AAGC TACC TT GT AT AAAGAC CT C AAC ACTGCTGACCATGATC AGCGC AGCCT GG AGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 
GAAT AC C AGAGGAT GC T AC AACTC T CT ACC TTCAG AAC AACC AAAT AAAT AAT GCT GGGATT CCTT CAGAT 
T T GAAAAAC T TGCT GAAAGT AGAAAG AAT AT ACC T AT ACC AC AAC AGT TT AG AT GAATTT CCTACC AACCT 
CCCAAAGTATGTAAAAGAGTTACATTTGCAAGAAAATAACATAAGGACTATCACTTATGATTCACTTTCAA 
AAATTCCCT ATCTGG AAG AATT AC ATTT AG AT GAC AAC T C TGTC TCTGCAGT T AGC AT AGAAGAGGGAGC A 
TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
H GCCCAGGACTATAGAAGAACTACGCTTGGATGATAATCGCATATCCACTATTTCATCACCATCTCTTCAAG 
5*? GTCTCACTAGTCTAAAACGCCTGGTTCTAGATGGAAACCTGTTGAACAATCATGGTTTAGGTGACAAAGTT 
!!!f TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 

01 

T CCAGGC AC AAACCT G AGGAAGCT T T AT CT TC AAGAT AAC CACATC AATC GGGT GCCCCC AAATGCTT T TT 
CTTATCTAAGGCAGCTCTATCGACTGGATATGTCCAATAATAACCTAAGTAATTTACCTCAGGGTATCTTT 
|.l GATGATTTGGACAATATAACACAACTGATTCTTCGCAACAATCCCTGGTATTGCGGGTGCAAGATGAAATG 
||1 GGTACGTGACTGGTTACAATCACTACCTGTGAAGGTCAACGTGCGTGGGCTCATGTGCCAAGCCCCAGAAA 
g AGGT TCGT GGGATGGC TAT T AAGGAT CT C AATGCAGAACT GT T T GAT T GT AAGGAC AGTGGG ATTGT AAGC 

O ACC ATT C AGATAACCACT GC AATACCCAAC AC AGT GT AT C CT GC CC AAGGAC AGTGGCCAGC TCCAGT GAC 

III C AAACAGCC AGAT AT T AAGAACCC C AAGCT C ACT AAGGAT CAAC AAAC CACAGGGAGTCCCT CAAG AAAAA 

. 

~™* CAAT TAC AAT TACTGT GAAGTCTGT C ACCT C TG AT ACC AT TCAT AT C T CT TGGAAACT T GCT CT ACCT ATG 

ACTGCTTTGAGACTCAGCTGGCTTAAACTGGGCCATAGCCCGGCATTTGGATCTATAACAGAAACAATTGT 
AACAGGGG AACG CAGT GAGT ACTT GGTC AC AGCC CT GG AG CCT GAT T C ACCCT ATAAAGT AT GC AT GGTTC 
C CAT GGAAAC CAGC AACC T CT ACCT ATT T GAT GAAACT CCTGT T TGTAT T GAGACT GAAACT GC ACCCCTT 
CGAATGT ACAAC CCT AC AACCACCCT C AAT CG AGAGC AAGAGAAAG AACC TT AC AAAAAC CCCAAT TT ACC 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
T TCATAGG AATG GATCGC T CTT CT C AAGGAACT GTGC AT AT AGC AAAGGG AGG AGAAGAAAGGATGAC TAT 
GCAGAAGCT GGC ACT AAGAAGGAC AACT CT AT CCTGGAAATC AGGG AAAC TTCT TTTCAG AT GTTACC AAT 
AAGCAATGAACC C AT CT C GAAGGAGG AGT T TGT AAT AC AC ACC AT ATT TC CTCC TAAT GG AAT G AATC TGT 
AC AAAAAC AATC AC AGTGAAAGC AGT AGT AACCG AAGC TAC AGAGACAGT GGT ATT CC AG AC TCAGAT C AC 
TC AC ACTC ATGAT GCT GAAGGACT C ACAGC AGAC TTGTGTTTTGGGTTTTT T AAAC CT AAGGGAGGTG ATG 
GT 
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FIGURE 132 

MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQINNAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKYVKELHLQENNIRTITYDSLSKIPYL 
E E LHLDDNS VSAVS I EE GAFRDSNYLRLL FL S RNHL S T I PWGL PRT I EE LRLDDNR I S T I S S P S L 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSNNNLSNLPQGI FDDLDNI TQL ILRNNPWYCGCKMKWVRDWLQS LPV 
KVNVRGLMCQAPEKVRGMAIKDLNAELFDCKDSGIVSTIQITTAIPNTVYPAQGQWPAPVTKQPD 
I KNPKLTKDQQT TGS P S RKT I T I TVKS VT SDT I H I S WKLAL PMTALRLS WLKLGHS PAFGS I TE T 
IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
O PYKNPNLPLAAIIGGAVALVTIALLALVCWYVHRNGSLFSRNCAYSKGRRRKDDYAEAGTKKD 
O I LE I RETS FQMLP I SNEP I SKEE FVIHT I FPPNGMNLYKNNHSESS SNRSYRDSGI PDSDHSHS 

SsS 

Important features of the protein: 
1=4 Signal peptide: 

amino acids 1-28 
s - Transmembrane domain: 

CI 

amino acids 531-552 

HI 

$*% N-glycosylation sites. 

||| amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 

t ; l Tyrosine kinase phosphorylation site. 

amino acids 515-522 

N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 



APP ID=1 006371 5 



Page 286 of 322 



FIGURE 13 3 

CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCGCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGTCTGGGCTTGGCCAGAGCTCAGGGTGCTGAGCGTGTGACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
CTCCACTACCTCAAGCTGAGTGATCCCAAGTACCTAAGAGAGTTCCAGCTGACCCTCCAGCCCGG 
GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
CCCAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 
GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGAGCAAGCCCGGCTGCTC 
AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
GACCAC TCCAACAGAGCCAGGACT ATAT CAACC TCT T CT GCGCCAACATGAT GGAC T T GAACCGC 
AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 
TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
AACAGCAGGAAGGAT GCT T CGGGGAGCC TGATGCTGAAGAT GAAGAAT TATC TAAAGC TAT TCAA 
TATCAGCAGCAT T T T TCGAGGAGAGT GAAGAGGCGAGAAAAACAAT T T CCAGAT TC TCGC TC TGT 
TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATTATAGGTGTGAGCCACCGTGTCTG 
GCT GAAAAGCAC T T T CAAAGAGAC T G T G T TGAATAAAGGGCCAAGGT TCT T GCCACCCAGCAC T C 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GTGGCTTCCTATACATCCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACACCCATCCACACCG 
C C AC CAC C AAG C AGC C GC T GAGAC GGAC G G T T C CAT GC C AGCT G C C T G GAGG AGGAAC AGAC CC C 
TTTAGTCCTCATCCCTTAGATCCTGGAGGGCACGGATCACATCCTGGGAAGAAGGCATCTGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGACCCAGGTGTGAACGGATGAATAAAGTTCAACTGCAACTGAAAAAAAAAAA 
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FIGURE 134 

MSARGRWEGGGRR^CRGSLGIARAQGAERVTSSEQRPAMASLGLLLLLLLTALPPLWSSSLPGLD 
TAESKAT IADL ILSALERATVFLEQRLPE INLDGMVGVRVLEEQLKSVREKWAQEPLLQPLSLRV 
GMLGEKLEAAI QRS LHYLKLS DPKYLRE FQLTLQPGFWKL PHAW I HTDAS LVYPT FGPQDS FSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINLFCAKMMDLNRRAEAIGYAYPTRDIFMENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 



hi 
SI 



Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 



Iff amino acids 39-56 



Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 

N-myristoylation sites. 

amino acids 10-16, 20-26, 63-69, 208-214 

Amidation site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1. 

amino acids 230-237 
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FIGURE 135 



GGTCTGAGTGCAGAGCTGCTGTCATGGCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGCTTACAGAT TTGATCCCGTTCGAGTGGATATCAC TTCGAAAGGAAAAAT GAGAGCAAGATATG 
TGAATTACATCAAAACATCAGAGGTTGTCAGACTGCCCTATCCTCTCCAAATGAAATCTTCAGGT 
y, CCACCTTCTTACTTTATTAAAAGGGAATCGTGGGGCTGGACAGACTTTCTAATGAACCCAATGGT 
Q TATGATGATGGTTCTTCGTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
J~f C T GAC AT GAGACGGGAAAT GGAGCAGT CAATGAATATGC TGAAT TCCAACCATGAGT T GCCTGAT 

01 

B ' 3 GTTTCTGAGTTCATGACAAGACTCTTCTCTTCAAAATCATCTGGCAAATCTAGCAGCGGCAGCAG 

'%l TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGGTAGTCAGGCCGTCCAGAGCTGGCATTTGCAC 

Jsfe AAACACGGCAACAC T GGGTGGCATCCAAGT CT TGGAAAACCGT GT GAAGCAACTACTATAAACT T 

111 GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 

^. AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 

29« GAGGTCAATATTGATGTCACTGAATTAATTACAGTGTCCTATAGAAAATGCCATTAATAAATTAT 

C| ATGAACTACT ATACAT TAT GTAT AT T AAT T AAAACATCT TAATCCAGAAAT CAAAAAAAAAAAAA 

CO AAAAAAAAAAAAAAA 

1*1. 

ry 
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FIGURE 1*6 



3XIAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAVVPGVKPQDWISAA 
RVLVDGEEHVGFLKTDGSFWHDIPSGSYWEWSPAYRFDPVRVDITSKGKMRARYVNYIKTSE 
VVRLPYPLQMKSSGPPSYFIKRESWGWTDFmNPMVMMiy^ 
QSMNMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 

Important features of the protein: 
Signal sequence: 

amino acids 1-23 

Transmembrane domain: 

amino acids 161-182 

N-glycosylation site. 

amino acids 184-187 

Glycosaminoglycan attachment sites. 

amino acids 37-40, 236-239 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 151-154 

N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 

Ami da ti on site. 

amino acids 238-241 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 127 

GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
C CAGAAGGGTGAGC T ACGT TGGCTT TCTGGAAGGGGAGGC TATATGC GTCAAT T CCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAAC TCGCACT CAGACTCT GAACTCAGAC C T GAAATC T T CT CT TCACGGGAGGC T TGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CT T C T CT C TGC T GCGT T T TATC T CCT AT GGAC T CCT T CCAC TGGACT GAAGACAC TCAAT T T GGG 
g , AAGCTGTGTGATCGCCACAAACCTTCAGGAAATACGAAATGGATTTTCTGAGATACGGGGCAGTG 
p T GCAAGCCAAAGAT GGAAACAT T GACAT CAGAATC T TAAGGAGGAC T GAG T CT T T GCAAGACACA 

0 AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
AAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGCAGCCTCGCCAATTCCTTTCTTA 
fi CCATCAAGAAGGACCTCCGGCTCTCTCATGCCCACATGACATGCCATTGTGGGGAGGAAGCAATG 
gj AAGAAAT ACAGCCAGATTCTGAGTCAC T T T GAAAAGC TGGAACC T CAGGCAGCAGTTGT GAAGGC 

Iff TT TGGGGGAACTAGACAT TCTTCTGCAAT GGATGGAGGAGACAGA ATAGG AGGAAAGTGATGCTG 

■ C T GCTAAGAATATT CGAGGTCAAGAGC T CCAG TC TT CAAT ACCT GCAGAGGAGGCAT GACCCCAA 

~£ ACCACCAT CTC T TTAC TGTACT AGT C T T GT GC TGGT CACAG TGT AT C T TAT T TATGCAT TAC TTG 

j£t CTTCCTTGCATGATTGTCTTTATGCATCCCCAATCTTAATTGAGACCATACTTGTATAAGATTTT 
m T G T AAT AT C T T T C T GCTAT T GGAT AT AT T TAT T AGT T AATATAT T TAT T TAT T T T T TGC TAT T T A 

Q . ATGTAT T TAT T T T T T TAC T TGGACAT GAAACT T TAAAAAAATT CACAGAT TAT AT T T AT AACC T G 

i|| ACTAGAGCAGGTGATGTATTTTTATACAGTAAAAAAAAAAAACCTTGTAAATTCTAGAAGAGTGG 

CTAGGGGGGTTATTCATTTGTATTCAACTAAGGACATATTTACTCATGCTGATGCTCTGTGAGAT 
AT TT GAAAT TGAACCAATGACTACT TAGGAT GGG T TG TGGAAT AAGT T T TGATG T GGAAT TGCAC 
ATCTACCTTACAATTACTGACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTATCTTCCAG 
CCAGGAATCCTACACGGCCAGCATGTATTTCTACAAATAAAGTTTTCTTTGCATACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE is8 

MRQFPKTS FDI SPEMS FS I YSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHS DSELRPE I F 
S S RE AWQ F FL L LW S P D FRPKMKAS S LAFS L L S AAF Y L L W T P S T G LKT LNL G S C V I ATNLQE I RNG 
FS E I RGS VQAKDGN I D IRI LRRTES LQDTKPANRCCLLRHLLRL YLDRVFKNYQT PDHYTLRKI S 
SLANSFLTIKKDLRLSHAHMTCHCGE 
E 



Important features of the protein: 
Signal peptide: 

Q amino acids 1-42 



cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

5....;., 

Ill N-myristoylation sites . 

JL amino acids 42-47, 46-51, 136-141 
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FIGURE 13Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTGGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCeCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCTCTCTTCAGTAAAGTTGTTATTGTTCTGATAGATGCCTTGAGAGATGATTTTGTGTTTG 
GGTCAAAGGGTGTGAAATTTATGCCCTACACAACTTACCTTGTGGAAAAAGGAGCATCTCACAGT 
TTTGTGGCTGAAGCAAAGCCACCTACAGTTACTATGCCTCGAATCAAGGCATTGATGACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
TAAGACAAGCAAAAGCAGCTGGAAAAAGAATAGTCT T TTAT GGAGATGAAACCTGGGTTAAATTA 
T T CCCAAAGCAT T T TGT GGAAT AT GAT GGAAC AACC T CAT TTTTCGTGT CAG AT T AC AC AGAGG T 
GGAT AATAAT G T CACGAGGCAT T TGGATAAAGTAT TAAAAAGAGGAGATT GGGACAT AT T AATCC 
TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
AAGCTGAGCGAGATGGACAGCGT GCTGATGAAGATCCACACCT CACTGCAGTCGAAGGAGAGAGA 
GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 
CCCGGT GATAT CCGACATC CAAAGCACG T CCAATAGACGGATGT GGC T GCGACAC TGGC GATAGC 
ACTTGGCTTACCGATTCCAAAAGACAGTGTAGGGAGCCTCCTATTCCCAGTTGTGGAAGGAAGAC 
CAAT GAGAGAGCAG T TGAGAT T T TT ACAT T TGAATACAGT GCAGCT TAGT AAAC T GT TGCAAGAG 
AATGTGCCGTCATATGAAAAAGATCCTGGGTTTGAGCAGTTTAAAATGTCAGAAAGATTGCATGG 
GAAC TGGAT CAGAC T G TAC T T GGAGGAAAAGCAT TCAGAAGTCC TAT T CAACCTGGGC TCCAAGG 
TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
* TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
Q CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 

CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
C7\AGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 
TCGGACAGCCTCCCAGCAGAGG T GT GGGAGCT GCAGC TGAGGGAAGAAGAGACAATC GGCC TGGA 
CAC T CAGGAGGG TCAAAAGGAGACT TGGT CGCACCAC TCATCC T GCCACCC CCAGAAT GCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
ACTCAGATCCACAGAGCCCAGGATCAAGGGACCCACTGCAGTGGCAGCAGGACTGTTGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
C T GACCGAGACAC TCACAGC T T T GT CATCAGGGCACAGGC T TC C TCGGAGCCAGGAT GATC TGT G 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
CTGC AC ACAGTATG TAGT TAC CAAAAGAAT AAAC GGCAATAAT T GAGAAAAAAAA 
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FIGURE 14Q 

MRLGS GT FATCCVAIEVLG IAVFLRGFFPAPVRS SARAEHGAE PPAPE PSAGAS SNWT TLPPPLF 
SKWIVLIDALRDDFVFGSKGVKFMPYTTYLVEKGASH^ 

VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWVKLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVLCGDHGMSETGSHGASSTEEVNTPLILISSAFERKPGDIRHPKHVQ 



Important features of the protein: 
Signal peptide: 

amino acids 1-34 



01 Transmembrane domain: 

amino acids 58-76 

Si . . 

■ r- 

■3Mj N-glycosylation sites. 

m ■ ■ ■ .' 

% amino acids 56-60, 194-198 



M* N-myristoylation sites. 

m 



amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
sf! 275-281, 278-284 



I'll 



Amidation site. 

amino acids 154-158 



Cell attachment sequence. 

amino acids 205-208 
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FIGURE 141 

GGCACGAGGCAAGCCT TCCAGGT T ATCGTGACGCACC T TGAAAG TC TGAGAGCTAC TGCCC TACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTGATGTTACTGCTGCTGTTGGAGTACAACT 
TCCCTATAGAAAACAACTGCCAGCACCTTAAGACCACTCACACCTTCAGAGTGAAGAACTTAAAC 
C CGAAG AAAT T C AGCATTCATGACCAGGAT CACAAAGTAC TGGT CCT GGACT CTGGGAATC TCAT 
AGCAGTTCCAGATAAAAACTACATACGCCCAGAGATCTTCTTTGCATTAGCCTCATCCTTGAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
T G T GACAAGGATAAAGGACAAAGTCATCCATCCC T T CAGC TGAAGAAGGAGAAAC TGATGAAGC T 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
Q ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
£3 GT T GGGGT GACAGATAAAT T TGAGAACAGGAAACACAT TGAAT T TT CAT T T CAACCAGTT T GCAA 

f| AGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAGGAAACTGCCCCATTGAACGCCTTCCTCGCTA 
AT T T G AAC T AAT T G T AT AAAAAC AC C AAAC C T GC T C AC T 
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FIGURE 142 



o 



b, 5 



% 

m 



MLLLLLEYNFPIENNCQHLKTTHTFRVKNLNPKKFSIHDQDHKVLVLDSGNLIAVPDKNYIRPEI 
FFALASSLSSASAEKGSPILLGVSKGEFCLYCDKDKGQSHPSLQLKKEKLMKLAAQKESARRPFI 
FYRAQVGSWNMLE SAAHPGWFI CT SCNCNE PVGVTDKFENRKH IE FS FQPVCKAEMS PS EVSD 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 

N-myristoylation site. 

amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182 
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FIGURE 14 ft 

C TAGAGAGTATAGGGCAGAAGGAT GGCAGATGAGTGAC TC CACAT C CAGAGCT GC CT CCCT T TAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGAGTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAGTGGAGTTTGCTGTCCACACATTCAACCAACAGAGCAAGGACTACTATGCCTACAGACTGGGG 
CACATCTTGAATTCCTGGAAGGAGCAGGTGGAGTCCAAGACTGTATTCTCAATGGAGCTACTGCT 
gj GGGGAGAAC TAGGTGT GGGAAAT T T GAAGACGACAT TGACAACT GCCATT TCCAAGAAAGCACAG 

Cl AGCTGAACAATACTTTCACCTGCTTCTTCACCATCAGCACCAGGCCCTGGATGACTCAGTTCAGC 
R fl C TCC T GAACAAGACC T GC T T GGAGGGAT T CCAC TGAGTGAAACCCAC T CACAGGCT TGTCCATGT 

w 

i| GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 

f^fe TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 

HI ACACAT CT TGAGCCTAATCATGTAG T GTAGAT CAT TAAACATCAGCAT T T T AAGAAAAAAAAA&A 

few 

if! 
CI 
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FIGURE 144 

MLGLPWKGGLSWALLLLLLGSQILLIYAWHFHEQRDCDEHNVMARYLPATVEFAVHTFNQQSKDY 
YAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQESTELNNTFTCFFTISTRP 
WMTQFSLLNKTCLEGFH 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-glycosylation sites. 

amino acids 117-121, 139-143 

N-myristoylation site. 

amino acids 9-15 
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FIGURE ij.pl 



CT GT GCAGCTCGAGGC TCCAGAGGCACAC TCCAGAGAGAGCC AAGGT T CT GACGC GATGAGGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAATCAAGTGGAACCGGAAGGCCCTGCCCAGCACTGCCCAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACTGGCAGTTCCCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGTTCCAGAAGCCAGACAACAAGCTCCACCAGCAGGTGCTCTGG 
CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
f| TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
CI AAGCTTGCCAGGAGGCTGGCAGTACAGAGCGCAGCAGCGAGCAAATCCTGGCAAGTGACCCAGCT 
Yi CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
■S ] TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
1^ CCGTCACTCCATTCTCAGCCCCTAGCAGAGCGTCTGGCACACTAGATTAGTAGTAAATGCTTGAT 
If! GAGAAGAACACATCAGGCAC TGCGCCACC TGCT TCACAGTAC T T CC CAACAACTCT T AGAGGTAG 

* GTG TATT C C CGT TT TACAGATAAGGAAAC TGAGGCCCAGAGAGC T GAAGT AC TGCAC CCAGCAT C 

^ ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 
S| GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACGCACATCCAAACATGTAT 
gfj C TC TAAT GAAAT TGT GAAAGCT CCATGTT TAGAAATAAATGAAAACAC CT GA 

£*) 
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FIGURE 146 

MRKHLS WWWLATVCMLL FSHLSAVQTRGI KHRI KWNRKALPS TAQ I TEAQVAENRPGAFIKQGRK 
LDIDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGCINATQAANQGEFQKPDNKLHQQ 
VL WRLVQE LC S LKHC E FWL E RGAGL RVTMHQP VL LC L L AL I WLMVK 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 157-171 

N-glycosylation sites. 

amino acids 98-102, 110-114 

Tyrosine kinase phosphorylation site. 

amino acids 76-83 

N-myristoylation sites. 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 

Amidation site. 

amino acids 62-66 
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FIGURE 147 

GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTCATGTTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCAGTGAGTCTTAGTTTTTATTTTTTGAAATTT 
C AAC T T T CAGAT TCAGGGGG T ACAT G T G AAGG T T T GT T T TATGAGTAT AT T GC ATGAT GCT GAGG 
TTTGGGGT 



%A ■ 



«r - 



III 
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FIGURE 148 



MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-myristoylation site. 

amino acids 62-68 
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FIGURE 14Q 



GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGA ATG GGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAATCTCTACATAATAGAGAATTAAGTGCAGAAAGACCTTTGAATGAACAGATTGCTGAA 
G CAGAAGAAGAC AAG AT T AAAAAAACAT AT C CT C CAGAAAACAAGC CAGGT CAGAG CAAC TAT T CT T T 
T GT T G AT AACT T GAAC C T GC T AAAG GC AAT AAC AG AAAAG G AAAAAAT T G AG AAAG AAAG AC AAT C T A 
M; TAAGAAGCTCCCCACTTGATAATAAGTTGAATGTGGAAGATGTTGATTCAACCAAGAATGGAAAACTG 
|3 AT C GAT GAT TAT G AC T C T AC T AAG AGT G GAT T G GAT C AT AAAT T T CAAG AT GAT C C AGAT G GT C T T C A 

Ill TCAACTAGACGGGACT CCTT TAAC CGC TGAAGACAT T GT CCATAAAAT C GCTGCCAGGAT TTATGAAG 

AAAAT GAC AGAGC CGT GTT T GACAAGAT TGT T TCTAAAC TACT TAATCT CGGC CT TATC ACAGAAAGC 
CAAGCACAT ACACT GGAAGATGAAGTAGCAGAGGTT T TACAAAAATTAAT CTCAAAGGAAGCCAACAA 
|,|^ TTAT GAGGAGGATCCCAATAAGCCCACAAGCT GGACT GAGAAT CAGGCTGGAAAAAT ACCAGAGAAAG 

If] TGACTCCAATGGCAGCAATTCAA.GATGGTCTTGCTAAGGGAGAAAACGATGAA?ICAGTATCTAACACA 
i T TAAC CT T GAC AAAT G GC T T GGAAAGGAGAACTAAAACC T ACAGT GAAGACAAC T T T GAG GAAC T CC A 

W ATAT T TCC CAAAT T T CTATGCGCT ACT GAAAAGTAT T GAT TCAGAAAAAGAAGCAAAAGAGAAAGAAA 

i 3f!E 

fj CACT GAT TACT AT CAT GAAAACACTGATT GACT TTGTGAAGAT GATGGTGAAATATGGAACAATATCT 

y 

C CAGAAGAAGG T GT T TCC T AC CT T GAAAACT T G GAT GAAAT GAT T G C T C T T C AGAC C AAAAAC AAGCT 
AGAAAAAAATGCTACT GACAATATAAGCAAGCT TTT CCCAGCACCATCAGAGAAGAGTCATGAAGAAA 
f|| CAGACAGT ACCAAGGAAGAAGCAGCTAAGAT GGAAAAGGAATATGGAAGCTTGAAGGAT T CCACAAAA 

GATGATAACTCCAACCCAGGAGGAAAGACAGATGAACCCAAA.GGAAAAACAGAAGCCTATTTGGAAGC 
CAT C AGAAAAAATAT T GAAT GGT T GAAGAAAC AT GACAAAAAGGG AAAT AAAGAAGATT ATGAC C T T T 
CAAAGAT G AGAGACT T CATC AAT AAACAAGC T GAT G C T T AT GT G GAGAAAGGC AT C C T T GACAAG GAA 
GAAGCCGAGGCCATCAAGCGCATT TATAGCAGCCTGTAAAAATGGCAAAAGAT CCAGGAGTCTTTCAA 
C T G T T T C AGAAAACAT AAT AT AGC T T AAAAC AC T T C T AAT T C T G T GAT T AAAAT T T T T T G ACC C AAG G 
GTTATTAGAAAGTGCTGAATTTACAGTAGTTAACCTTTTACAAGTGGTTAAAACATAGCTTTCTTCCC 
G TAAAAAC TAT C T GAAAGTAAAGT TGT AT GTAAGC T GAAAAAAAAAAAAAAAAAAAA 
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FIGURE ISO 

MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENKPG 
QSNYSFVDNLNLLKAITEKEKIEKERQSIRSSPLDNKLNVEDVDSTKNRKLIDDYDSTKSGLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKLISKEANNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKTYSEDNFEELQYFPNFYALLKSIDSEKEAKEKETLITIMKTLIDFVKMMVKYGTISPEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAI RKN I EWLKKHDKKGNKEDYDLSKMRD FI NKQADAYVEKG I LDK 
EEAEAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 
220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-myristoylation sites: 

amino acids 143-148, 239-244 
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FIGURE 151 



CGGCT CGAGGC T CC C GCCAGGAGAAAGGAAC AT T C TGAGGGGAG T C TACAC C C T G TGGAGC TCAA 
GATGGTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
ATAAT AACCAGCT T C TAGC TGGAGGGCT G CATGCAGGGAAGGT CAT TAAAGGTGAAGAGATCAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGTGCCTGTCATGTGGGGTGGGGCAGGAGCCGACTCTAACACTAGAGCCAGTGAACATCATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
T GT CAGACT CACCCAGCT T CCC GAGAATGGT GGCTGGAATGCCCCCAT CACAGAC T TC TAC T TCC 
AGCAGTGTGACTAGGGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGTGGAGGAGACCCATGGCGGACAATCACTCTCTCTGCTCTCAGGACCCCCACGTCTGACTTAG 
TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
J* CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
O TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
P GGGTCAGTAGCTCTCCACATGAAGTCCTGTCACTCACCACTGTGCAGGAGAGGGAGGTGGTCATA 
ffl GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
Z s l TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 

GAAAT GGCTCGAGC TCAGAAGATAAAAGATAA.GTAGGGTATGC T GAT CCTC TT T TAAAAACCCAA 
y GATACAATCAAAATCGCAGATGCTGGTCTCTATTCCCATGAAAAAGTGCTCATGACATATTGAGA 
|M AGAC C TAC T TACAAAGTGGCAT ATAT T GCAAT T TAT T T TAAT TAAAAGATACCTATT TATATAT T 

TCTTTATAGAAAAAAGTCTGGAAGAGTTTACTTCAATTGTAGCAATGTCAGGGTGGTGGCAGTAT 
AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 
AT T C C T T G T AT AAAAA T AAG AAAAG AAAT TAAT C T T GAG G T AAGC AG AG C AG AC AT CAT C T C T G A 
14 TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
|pl T TGT AGTAGTGAT CAGGAAACAGAT CT CAGCAAAGCCACTGAGGAGGAGGC TGTGC TGAGT T TGT 

Q GTGGCTGGAATCTC TGGGTAAGGAACT TAAAGAACAAAAATCAT CTGGTAATTC T TTCC TAGAAG 

|S GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTAAGAAGGCTGCTGTACTGGTTGA 

ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
GTCTCTGCAGATGTAGTTAGTTAAGACAAGGTCATGCTGGATGAAGGTAGACCTAAATTCAATAT 
jf|J GACT GGTT T CC T TG TATGAAAAGGAGAGGACACAGAGACAGAG GAGACGCGGGGAAGAC TATG TA 

AAGAT GAAGGCAGAGATC GGAGT TT T GCAGC CACAAGCTAAGAAACACCAAGGAT T GT GGCAACC 
ATCAGAAGCTTGGAAGAGGCAAAGAAGAATTCTTCCCTAGAGGCTTTAGAGGGATAACGGCTCTG 
C TGAAACC T TAATC T CAGAC T T C CAGCCT CC TGAAC GAAGAAAGAATAAAT T TCGGCT GT T T TAA 
GCCACCAAGGATAAT T GG T TACAGCAGCT C TAGGAAAC TAAT ACAGC T GC T AAAAT GATCCC TGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
T AGAATAT GGCAGAAGTGATGGCAT GC CACT TCCAAGAT TAGG T TATAAftAGACAC TGCAGCT TC 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAA.TCTATCTTGGCTCACTCGCTCTGGGGG 
AAGCTAGCTGCCATGCTATGAGCAGGCCTATAAAGAGACTTACGTGGTAAAAAATGAAGTCTCCT 
GCCCACAGCCACAT TAGTGAACC T AGAAGCAGAGAC TC TG TGAGATAATC GATGT T TGT TGT T TT 
AAGT T GCT CAGT T T TGGT C TAAC T T GT TATGCAGCAATAGATAAATAATAT GCAGAGAAAGAG 
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FIGURE 152 

MVLSGALC FRMKDS ALKVL YLHNNQLLAGGLHAGKVI KGEE I S WPNRWLDAS L S P VI LGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAPITDFYFQQCD 

N-myristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 

Interleukin-1 signature. 

amino acids 111-131 

Interleukin-1 proteins . 

amino acids 8-29, 83-120, 95-134, 64-103 
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FIGURE 153 

CTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAATTGTCTGCAATGGCCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCGATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 
CAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTATGAGTGAGCGCTGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
, i TATTGAAGGTGATGACCTGCATATCCAGAGGAATGTGCAAAAGCTGAAGGACACAGTGAAAAAGC 
1^1 TTGGAGAGAGTGGAGAGATCAAAGCAATTGGAGAACTGGATTTGCTGTTTATGTCTCTGAGAAAT 
Q GCCTGCATTTGACCAGAGCAAAGCTGAAAAATGAATAACTAACCCCCTTTCCCTGCTAGAAATAA 
51 CAATTAGAT GCCCCAAAGCGAT TTTT TTTAACCAAAAGGAAGATGGGAAGCCAAAC TCCATCATG 

gel 

ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 

; S;, ; I 

s ' ? - AGACC AGAAGGTAGAC T T TC TAAGCATAGATAT T TAT TGATAACAT T T CAT T GTAACTGGTG T TC 

h? T ATACACAGAAAACAAT T TAT T T TT TAAATAAT TGTCTTTT TCCAT AAAAAAGAT TAC T T TCCAT 

s TCCTTTAGGGGAAAAAACCCCTAAATAGCTTCATGTTTCCATAATCAGTACTTTATATTTATAAA 
Cl TGTAT TTAT TATTAT TATAAGACTGCATT TTAT TTATATCATT TTAT TAATATGGATT TAT TTAT 

AGAAACAT CAT T CGATAT T GC TACT T GAG TGTAAGGC TAAT AT TGAT AT T T ATGACAATAAT TAT 
52 . AGAGC TATAACATGT T TAT T TGACC T CAATAAACAC T TGGATATCCC 

m 

s 
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FIGURE ISA 

MAALQKSVSSFLMGTLATSCLLLIALLVQGGAAAPISSHCRLDKSNFQQPYITNRTFMLAKEASL 
ADNNTDVRLIGEKLFHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 
T CH IEGDDLH I QRNVQKLKDTVKKLGES GE I KAI GELDLL FMSLRNAC I 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

N-glycosylation sites. 

amino acids 54-58, 68-72, 97-101 

N-myristoylation sites. 

amino acids 14-20, 82-88 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 
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FIGURE 155 

GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACT TGTGACTGAGTGT GCAGTGCCCAGCATGTACCAGGT CAGTGCAGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGATTAGGTGAGGACAGTTCTCTCATTAGCCTTTTCCTACAGGTGGTTGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGGACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
GCCCAACGGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
TGCCTGTGCCCGCACTGCGTCAGGCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
GCTGCTCTACCACAACCAGACTGTCTTCTACAGGCGGCCATGCCATGGCGAGAAGGGCACCCACA 
AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
GTGATGGGC TAG CCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
CAGGATCCCGGGACAGGATGGGGGGC T TT GGGGAAAAGC T GCAC T TC TGCACAT T TTGAAAAGAG 
CAGCTGCTGCTTAGGGCCGCCGGAAGGTGGTGTCCTGTCATTTTCTCTCAGG7\AAGGTTTTCAAA 
GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCGCCTGCTACCCTG 
GCCCAGCACAGGCAC TTTC TAGATATT TCCCCCTTGC TGGAGAAGAAAGAGCCCC TGGTTTTATT 
TGTT T GTT TAC TCATCAC T CAGT GAGCAT CTAC T T T GGGT GCAT T CTAG T G TAGT TAC TAG T CT T 
TTGACATGGATGATTCTGAGGAGGAAGCTGTTATTGAATGTATAGAGATTTATCCAAATAAATAT 
CTTTATTTAAAAATGAAAAA 
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FIGURE 156 

MRERPRLGEDSSLISLFLQWAFLAMVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 

N-glycosylation site. 

amino acids 136-140 

Tyrosine kinase phosphorylation site. 

amino acids 127-135 

N-myristoylation sites. 

amino acids 44-50, 150-156 
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FIGURE isz 

CCGGC GATG TCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
C CGGAGACT TGAGGGACC TCCGAGTAGAACCTG T TACAAC TAG T GT T GCAACAGGGGAC TAT TCA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GATTTGTGTGACGGGCAAAAGCAACTTCCAGTCCTACAGCTGTGTGAGGTGCAATTACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGCTGAACACAGTCTATTTCATTGGGGCCCATAATATTCCTAATGCAAATATGAATGAAGATGG 
CCCT T CCAT GTC TGT GAAT T TCACCT CACCAGGC TGCC TAGACCACAT AAT GAAATATAAAAAAA 
AGTGTGTCAAGGCCGGAAGCCTGTGGGATCCGAACATCACTGCTTGTAAGAAGAATGAGGAGACA 
GTAGAAGTGAACTTCACAACCACTCCCCTGGGAAACAGATACATGGCTCTTATCCAACACAGCAC 
TATCATCGGGT T T TC T CAGG TGT T TGAGCCACACCAGAAGAAACAAACGCGAGC T TCAGTGG T GA 
TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGGTGTCTCTGCTGGTGGCCA 
CATGGGTGCTGGTGGCAGGGATCTATCTAATGTGGAGGCACGAAAGGATCAAGAAGACTTCCTTT 
TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 
TCACACAAT T TGT TAC T TCACTGAAT T TCT TCAAAACCAT T GCAGAAG T GAGGTCATCC TT GAAA 
AGT GGCAGAAAAAGAAAAT AGCAGAGATGGGTCCAGT GCAG TGGC TT GCCAC TCAAAAGAAGGCA 
GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 
GTGAT CT AAGAAGCCAGAT TCAT CT GCACAAATACGTGGTGGT CT ACT T TAGAGAGAT TGATACA 
AAAGACGATTACAATGCTCTCAGTGTCTGCCCCAAGTACCACCTCATGAAGGATGCCACTGCTTT 
CTGTGCAGAACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTGTAG 
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FIGURE 158 



MSLVLLSLAALCRSAVPREPTVQCGSETGP SPEWMLQHDLT PGDLRDLRVEPVTTS VATGDY S I LMNVSWV 

LRADASIRLLKATKICVTGKSNFQSYSCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 

NANMNEDGPSMSVNFTSPGCLDHIMKYKECKCVECAGSLWDPNITACKKNEETVEVNFTTTP 

STIIGFSQVFEPHQKKQTRASWIPVTGDSEGATVQLTPYFPTCGSDCIRHKGTWLCPQTGVPFPLDNNK 

SKPGGWLPLLLLSLLVATWVLVAGIYLMWRHERIKKTSFSTTTLLPPIKVLWYPSEICFHHTICYFTEFL 

QNHCRSEVILEKWQKKKIAEMGPVQWIJVTQKKAADKVVFLLSNDVNSVCDGTCGKSEGSPSENSQDLFPLA 

FNL FC S DL RS Q I HLHKY V W Y FRE I DT K DD Y N AL S VC P KY HLMKDAT A FC AELL H V KQQ VSAG KR S QAC H D 

GCCSL 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

Transmembrane domain: 

amino acids 290-309 

N-glycosylation sites. 

amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 
- 287 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 
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FIGURE 15Q 

AGCCACCAGCGCAAC ATGA CAGT GAAGACCCT GCATGGCCCAGCCAT GGTCAAGTAC TTGCT GC T 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACTTGGGCTGCATCAATGCTCAAGGAAAGGAAGACATCTCCATGAATTCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAGTAAGAGGTGCATATCC 
AC T C AGC T GAAGAAG 
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FIGURE 160 



MTVKTLHGPAMVKYLLLSILGLAFLSEAAARKIPKVGHTFFQKPESCPPVPGGSMKLDIGIINEN 
QRVSMSRNIESRSTSPWNYTVTWDPNRYPSEWQAQCRNLGCIMAQGKEDISMNSVPIQQETLW 
RRKHQGCS VS FQLEKVLVTVGCTCVT PVIHHVQ 

Signal sequence : 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 

N-myristoylation sites. 

amino acids 106-111, 136-141 
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FIGURE 161 



ACACTGGCCAAACAAAAACGAAAGCACTCCGTGCTGGAAGTAGGAGGAGAGTCAGGACTCCCAGG 
ACAGAGAG TGCAC AAAC TACCCAGCACAGCCCCC TCCGCC CCCT CTGGAGGC TGAAGAGGGAT TC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
T.GGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCC T GAAGATGAGGAAAAGT T TGGAGGAGCAGCTGACT CAGGGG TGGAGGAGC C TAGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
f zk GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
C| ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
p GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
SJ CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
f\ CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
^ CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
% 4 GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
Jffjb TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
jf; AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
W * CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 

G TGGC TGTACTT CAC T ACC CAGCAAAGCC T C CACGAGGGCAGCT CGC CT TGGAGAGTAC T TAC T A 
C| CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
HI CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
|4 CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 

CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
W CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
P CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
HI CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 

GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGACT TAAATAAAGGCAGAC GC T GT T T T T C TAAAAAAA 
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FIGURE 162 



MPVPWFLLSLALGRSPVVLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQECETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQVVLSFQAYPTARCVLLEV 
QVPAALVQFGQSVGSWYDCFEAALGSEVRIWSYTQPRYEKELNHTQQLPALPWLNVSADGDNVHLVLNVS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDKVXEFPLLKGHPNLCVQ 
VNSSEfCLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWACPMDKYIHKRWALVWLACLLFAAALSLILLLKKDHAKGWLRLLKQDVRSGAAARG 
RAALLLYSADDSGFERLVGALASALCQLPLRVAVDLWSRRELSAQGPVAWFHAQRRQTLQEGGVWLLFSP 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

Signal sequence: 

amino acids 1-20 

Transmembrane domain. 

amino acids 453-475 

N-glycosylation sites. 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 



i --ivt. 

m 

ill 334-337, 357-360, 391-394 

Glycosaminoglycan attachment site. 

amino acids 583-586 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 



N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 



GGGAGGGCTCTGTGCCAGCCCCGATCAGGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 

GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 

AAAC AT C C T GAC G T GG GAC AG C G G G C C AG AG G GC AC C C C AG AC AC G G T C T AC AGC AT C GAG TATA 

AGAC GT AC GGAGAGAGGGAC TGGG T GGCAAAGAAGGGCTG T CAGCGGATCACCCGGAAGTC C TGC 

AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 

GTCAGTGCGGGAGGCCGGTCAGCCACCAAGATGACTGACAGGTTCAGCTCTCTGCAGCACACTAC 

CCTCAAGCCACCTGATGTGACCTGTATCTCCAAAGTGAGATCGATTCAGATGATTGTTCATCCTA 

CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 

T TCTACCACTTAGAGC TCCAGGTCAACCGCACC TACCAAATGGACCTT GGAGGGAAGCAGAGAGA 

ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 

CCTGGGCCAAGGAGAGTGCCCCCTACATGTGCCGAGTGAAGACACTGCCAGACCGGACATGGACC 

TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 

CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 

TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 

AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 

T CCACAGCGGCATAGCCT GTCC GAGAT CACCTACT TAGGGCAGC CAGACAT C TCCAT CC TCCAGC 

CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 

GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 

GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 

CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 

CACCTTAGGCCTAAAGGTCAGCTTCAGAAAGAGCCACCAGCTGGAAGCTGCATGTTAGGTGGCCT 

TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 

CCCTGGGGATTTGCACAGACAGAACATCTGACCCAAATGTGCTACACAGTGGGGAGGAAGGGACA 

CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 

CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC . 

TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 

GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 

AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 

ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 

AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 

AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 

CAAGGCAGAAATGACAGTGCAAGGAGGAAATGCAGGGAAACTCCCGAGGTCCAGAGCCCCACCTC 

CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 

ACAATCTAGCTCGACAGAGCATGAGGCCCCTGCCTCTTCTGTCATTGTTCAAAGGTGGGAAGAGA 

GCCTGGAAAAGAACCAGGCCTGGAAAAGAACCAGAAGGAGGCTGGGCAGAACCAGAACAACCTGC 

ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 

TTCCCAGCCAGGGCAACTGCCTGACGTTGCACGATTTCAGCTTCATTCCTCTGATAGAACAAAGC 

GAAATGCAGG T CCACCAGGGAGGGAGACACACAAGCCT T T TC TGCAGGCAGGAGT TT CAGACCC T 

ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 

TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 

CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 

ATCTGTAATGTGGGGATCATAACACCTACCTCATGGAGTTGTGGTGAAGATGAAATGAAGTCATG 

TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 

AAAAAAAA 
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FIGURE 164 

MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRSIQMIVHPTPTPIRAGDGHRLTLEDIFHDLFYHLELQVNRTYQMHLGGKQREYEFFGLTPDT 
EFLGTIMICVPTWAKESAPYMCRVKTLPDRTWTYSFSGAFLFSMGFLVAVLCYLSYRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSLAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPP.SGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

Signal sequence. 

amino acids 1-17 

Transmembrane domain. 

amino acids 233-250 

N-glycosylation sites. 

amino acids 80-83, 87-90, 172-175 

N-myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 



APP ID- 
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FIGURE 165 



TGGCCTACTGGAAAAZ^AAAAAAAAAAAAAAAAAGTCACCCGGGCCCGCGGTGGCCACAACATGG 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGGTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGG TGAC GATGTATATGT CT AC TACAAAC T GGCAGGGGGATCCC TT GAAC TT TGGGC T GGA 
AGTGT TGAACACAGT T TTGGATATTT TCCAAAAGATTTGATCAAGGTACT TCATAAATACACGGA 
AGAAGAGC TACATAT TCCAGCAGAT GAGACAGAC T T T GTC TGCT T T GAAGGAGGAAGAGATGAT T 
T T AAT AGT T ATAATGTAGAAGAGCT T TT AGGATC T T T GGAACT GGAGGACT CTGTACC TGAAGAG 
T CGAAGAAAGC TGAAGAAGT T T CTCAGCACAGAGAGAAAT C TC C T GAGGAGT C T CGGGGGCGTGA 
ACTTGACCCTGTGCCTGAGCCCGAGGCATTCAGAGCTGATTCAGAGGATGGAGAAGGTGCTTTCT 
CAGAGAGCACC GAGGGGC T GCAGGGACAGCCCTCAGC TCAGGAGAGC CACC CTCACACCAGCGGT 
CCTGCGGCTAACGC T CAGGGAGT GCAGT CT TCGT TGGACAC T T T TGAAGAAATT CTGCACGATAA 
AT T GAAAG T GCCGGGAAGCGAAAGCAGAAC TGGCAATAG T TCTC CTGC CT C GGT GGAGCGGGAGA 
AGACAGATGCTTACAAAGTCCTGAAAACAGAAATGAGTCAGAGAGGAAGT GGACAGTGCGTTAT T 
CAT TACAGCAAAGGAT T TCGT TGGCAT CAAAATC TAAGT T TGT T T T ACAAAGAT TGTT TT TAGTA 
CTAAGCTGCCTTGGCAGTTTGCATTTTTGAGCCAAACAAAAATATATTATTTTCCCTTCTAAGTA 
AAAAAAAAAAAAAAAAAAAA 
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FIGURE 166 



MAAAPGLLFWLFVLGALWOTPGQSDLSHGRRFS^ 

FKKGDDVYVYYKLAGGSLELWAGSVEHSFGYFPKDLIKVLHKYTEEELHIPADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKTDAYKVLKTEMSQRGSGQCVIHYSKGFRWHQNLSLFYKDCF 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation site. 

amino acids 294-298 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 30-34 

Tyrosine kinase phosphorylation site. 

amino acids 67-76 

N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 

Amidation site. 

amino acids 28-32 
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FIGURE 167 



CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
.CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCCTTGAGAATGAAGCAGAACAGAAGTTAATAGAGAGCATGTTGCAAAACCTGACAAAACCCGGG 
y, ACAGGGATTTCTGATGGTGATTTCTGGATAGGGCTTTGGAGGAATGGAGATGGGCAAACATCTGG 
jg* TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 
fi ATG^ACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
gf CTTGGGGGTCCCTACCTTTACCAGTGGAATGATGACAGGTGTAACATGAAGCACAATTATATTTG 
y CAAGTATGAACCAGAGATTAATCCAACAGCCCCTGTAGAAAAGCCTTATCTTACAAATCAACCAG 
GAG^CACCCATCAGAATGTGGTTGTTACTGAAGCAGGTATAATTCCCAATCTAATTTATGTTGTT 
|i ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
1|| GCATAAAAGTAAAGGAAGAACAAAAACTAGTCCAAACCAGTCTACACTGTGGATTTCAAAGAGTA 
CCAGAAAAGAAAGTGGCATGGAAGT ATAA TAACTCATTGACTTGGTTCCAGAATTTTGTAATTCT 
p GGATCTGTATAAGGAATGGCATCAGAACAATAGCTTGGAATGGCTTGAAATCACAAAGGATCTGC 
|!| AAGATGAACTGTAAGCTCCCCCTTGAGGCAAATATTAAAGTAATTTTTATATGTCTATTATTTCA 
p TTTAAAGAATATGCTGTGCTAATAATGGAGTGAGACATGCTTATTTTGCT^^ 
f|| AC T T CAAAC T T CAAG CAAAT GAAAT GGACAAT GCAGATAAAG T T GT TAT C AAC AC G T C GGGAG T A 

gj TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
H| TGTATATTGTATTGAAATTTACAGTGTGCAAAAGTATTTTACCTTTGCATAAGTGTTTGATAAAA 
ATGAACTGTTCTAATATTTAT^TTTATGGCATCTCATTTTTCAATACATGCTCTTTTGATTAAAG 
AAACTTATTACTGTTGTCAACTGAATTCACACACACACAAATATAGTACCATAGAAAAAGTTTGT 
TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
AAT^GAAGCTATTTCATTAAGTGTGATATAAACCTCCTCAAACATTTTACTTAGAGGCAAGGAT 
TGTCTAATTTCAATTGTGCAAGACATGTGCCTTATAATTATTTTTAGCTTAAAATTAAACAGATT 
T T G T AATAAT G T AAC T T T G T T AAT AGG T GCAT AAACAC T AAT GC AGT C AAT T T GAAC AAAAGAAG 
TGACATACACAATATAAATCATATGTCT TCACACGT TGCCTATATAAT GAGAAGCAGCTCTCTGA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
ATTGACACTGGAGGCAGATAGTTGCAAAGTTAGTCTAAGGTTTCCCTAGCTGTATTTAGCCTCTG 
ACTATATTAGTATACAAAGAGGTCATGTGGTTGAGACCAGGTGAATAGTCACTATCAGTGTGGAG 
ACAAGCACAGCACACAGACATTTTAGGAAGGAAAGGAACTACGAAATCGTGTGAAAATGGGTTGG 
AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCTCCTTT 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AA^\AAAAA?VAAAAAAAAAAAAAAAAAAA?\AAAAA 
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FIGURE 168 



MSRWSLLLGAALLCGHGAFCRRWSGQKV^ 

GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGD'FWIGLWRNGDGQTSGACPDLYQWS'DGSNSQ 
YRNWYTDEPSCGSEKCVVMYHQPTANPGLGGPYLYQWNDDRCNMKHNYICKYEPE INPTAPVEK 
PYLTNQPGDTHQNWVTEAG 1 1 PNL I YWI PT I PLLLL I LVAFGTCC FQMLHKSKGRTKTS PNQ 
STLWISKSTRKESGMEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 

amino acids 214-235 

N-glycosylation sites, 

amino acids 86-89, 255-258 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 266-269 

N-myristoylation sites, 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 
145, 212-217 
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